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Galacturonosyltransferases (GalATs) are required for the synthesis 
of pectin, a family of complex polysaccharides present in the cell 
walls of all land plants. We report the identification of a pectin 
GalAT (GAUT1) using peptide sequences obtained from Arabidop- 
sis ttialiana proteins partially purified for homogalacturonan (HG) 
a-1,4-GalAT activity. Transient expression of GAUT1 cDNA in the 
human embryonic l<idney cell line HEK293 yielded uridine diphos- 
phogalacturonic acidiGalAT activity. Polyclonal antibodies gener- 
ated against GAUT1 immunoabsorbed HG a-1,4-GalAT activity 
from Arabidopsis solubilized membrane proteins, blast analysis of 
the Arabidopsis genome identified a family of 25 genes with high 
sequence similarity to GAUT1 and homologous genes in other 
dicots, in rice, and in Physcomitrella. Sequence alignment and 
phylogenetic Bayesian analysis of the Arabidopsis GAUT1 -related 
gene family separates them into four related clades of GAUT and 
GAUT-like genes that are distinct from the other Arabidopsis 
members of glycosyltransferase family 8. The identification of 
GAUT1 as a HG GalAT and of the GAUT1-related gene family 
provides the genetic and biochemical tools required to study the 
function of these genes in pectin synthesis. 
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pectic polysaccharide 

Pectins are structurally complex plant cell-wall polysaccha- 
rides that contain 1,4-linked a-D-galactopyranosyluronic 
acid residues. Galacturonic acid (GalA) is the most abundant 
glycosyl residue in liie liu'ee types of pectin present in all plant 
primary walls: liomogalacturonan (HG), rhamnogalacturonan I 
(RG-I), and rhamnogalacturonan II (RG-II) (1). Pectin ac- 
counts for =35% of dicot and nongraminaceous monocot pri- 
mary walls and '^10% of the primary wall of grasses (2). Pectins 
are also present in the walls of gymnosperms, pteridophytes, and 
bryophytes as well as Chara, a charophycean algae that is 
believed to be the closest extant relative of land plants (3). 

Numerous studies show that pectins contribute to the physical 
and biochemical properties of the wall (1) and are required for 
normal plant growth and development (4). A complete under- 
standing of pectin function requires knowledge of pectin bio- 
synthetic enzymes and their corresponding genes. Although the 
enzyme activities of proteins encoded by some pectin biosyn- 
thetic genes, particularly those involved in the synthesis of sugar 
nucleotides, have been elucidated (5-9), the activities of putative 
pectin biosynthetic glycosyltransferases [e.g., QUAl (10) and 
NpGUTl (11)] have not been definitively demonstrated. 

HG is apolymer of a-l,4-linked GalA that accotmts for '»65% 
of pectin. a-l,4-Galacturonosyltransferase (GalAT) activity has 
been identified in numerous plants (12) and shown to be 
membrane -bound in all species studied. Work in pea localized 
HG:GalAT activity to the luminal side of Golgi vesicles (13), the 
same location as pectin synthesis (14, 15). The most extensive 
study of HG:GalAT activity was done in tobacco (16, 17), and 



characteristics of the tobacco enzyme are comparable with the 
activity described in other plant species. 

In vitro, GalAT transfers GalA from uridine diphospho-GalA 
(UDP-GalA) onto endogenous acceptors to produce pectic 
products with molecular masses of 100 kDa to >500 kDa (13, 
16). Establishment of conditions to recover detergent- 
solubilized GalAT activity from membrane fractions (17) and in 
vitro studies using radiolabeled substrate (17) or fluorescently 
tagged (18, 19) acceptors established that, in vitro, GalAT 
preferentially transfers GalA onto the nonreducing end (20) of 
HG oligosaccharide acceptors [oligogalacturonides (OGAs)] of 
a degree of polymerization (DP) >9 (17, 18), although OGA 
acceptors as small as a trimer can be used (18, 19). Polymeric 
pectins, such as poly-GalA and pectin, are less favorable sub- 
strates (21). Membrane-permeabilized GalAT activity from 
pumpkin yielded a population (rf OGAs elongated by up to five 
galacturonosyl residues (19), whereas the solubilized petunia 
enzyme added up to 27 galacturonosyl residues onto the OGA 
acceptors (18). Clarification of the mode of action of GalAT(s) 
and the mechanism of HG synthesis requires access to purified 
or recombinantly expressed enzymc(s). 

A protein-purification approacli was lai^cn to identify an 
Arabidopsis tfialiana gene encoding llG:GalAr, because no gene 
encoding an enzymatically verified GalAT has been identified in 
any organism. GalAT was partially purified from Arabidopsis 
suspension-cultured cells and bioinformatics together with pep- 
tide sequence data were used to identify two putative GalATs. 
Functional characterization of the protein encoded by one of 
these genes (At3g61130), by using numerous biochemical 
methods, provides compelling evidence that this protein is a 
HG:GalAT (16). We therefore named this gene galacturonosyl- 
transferase 1 (GAUTl). 

Results and Discussion 

Identification of GAUTl. Solubilized membrane proteins were 
isolated from log-phase suspension-cultured A. thaliana cells 
(22). GalAT activity was partially purified by sequential SP- 
Sepharose, Reactive yellow 3, and repetitive UDP-agarose chro- 
matography (see lane 5 in Fig. 4, and Table 1, which are 
published as supporting information on the PNAS web site). The 
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GalAT fraction was treated with trypsin, and resulting peptides 
were sequenced by liquid chromatography tandem MS. 1 he 
peptide sequences obtained were screened against the Arahi- 
dopsis genome, revealing two related proteins (JS33 and JS36) 
that correspond to coding sequences At2g38650 and At3g61 130, 
respectively (see Fig. 5A and Table 2, which are published as 
supporting information on the PNAS web site). Both sequences 
are predicted to encode glycosyltransferases with type II mem- 
brane topology (23), C-terminal catalytic domains with consen- 
sus sequences clustered in CAZy glycosyltransferase family 8 
(24), and to have basic isoelectric points. These characteristics 
are consistent with the known in vitro biochemical properties ol 
GalATs (17, 20). The predicted catalytic domains also contain 
DxD motifs, a conserved motif present in many well character- 
ized glycosyltransferase families (25) and involved in the coor- 
dination of divalent cations (26). RT-PCR showed that tran- 
scripts from both genes are present inArabidopsis flowers, roots, 
stems, and leaves (see Fig. 6, which is published as supporting 
information on the PNAS web site). 

cDNA constructs containing N-terminally truncated versions 
of JS33 (JS33A1-43) and JS36 (JS36A1-41) downstream of a 
Trypanosoma cruzi mannosidase signal sequence, a polyhistidine 
tag, and two copies of a hemagglutinin (HA) epitope in the 
mammalian expression vector pEAKlO were generated (Fig. 5B) 
and used to transiently transfect human embryonic kidney 
(HEK)293 cells. Growth media from transiently transfected cells 
were treated with anti-HA antibodies bound to protein A- 
Sepharose. The protein immunoprecipitated from the JS36A1- 
41-transfected cells catalyzed the incorporation of [i-'CJGalA 
from UDP-["C]GalA onto HG OGA acceptors (Fig. hi). No 
GalAT activity was recovered from immunoprecipitates of the 
.TS33A 1 -43 or empty vector-transfected cells. These results pro- 
vided evidence that JS36 encoded a GalAT that catalyzes the 
iranslcr of GalA from UDP-GalA onto HG, thus, we named 
.IS36 CiAUTl. 

Immunoabsorption of GalAT Activity from Partially Purified Arabi- 

dopsis SP-Fraction by Using GAUT1 Antiserum. To confirm that 

(lAU'i'i was involved in HG synthesis, GAUTl antiserum was 
produced and used in an immunoprecipitation assay to deplete 
( lalA'i' activity present in partially purified protein preparations 
from Arabidopsis suspension-cultured cells. The GAUTl anti- 
serum contained antibodies that recognized native (by immu- 
noabsorption) and denatured (by Western analysis) recombi- 
nantly expressed truncated GAUTl (see Fig. 7, which is 
published as supporting information on the PNAS web site) and 
an «=60-kDa protein present in the partially Tpmiiied Arabidopsis 
SP-Sepharose fraction. The GAUTl antiserum immunode- 
pleted GalAT activity from the Arabidopsis SP-Sepharose frac- 
tion (Fig. IB, open diamonds). Furthermore, GalAT activity was 
immunoabsorbed by the conjugated GAUTl antiserum in a 
dose-dependent manner (Fig. IB black boxes), confirming that 
GAUTl is a GalAT. Western analysis of the depleted fraction 
using increasing amounts of GAUTl antiserum (see Fig. %A, 
which is published as supporting information on the PNAS web 
site) demonstrated the progressive disappearance of the 60-kDa 
protein band from the SP-Sepharose fraction. Conversely, the 
60-kDa band appeared in the immunoabsorbed fractions with 
increasing amounts of GAUTl antiserum (Fig. 85). 

The ability of anti-GAUTl immunoprecipitates to elongate 
OGAs was established by testing the sensitivity of the OGA 
products to cleavage by a HG-specific exopolygalacturonase 
(EPG) (Fig. 2 A and B). The bulk (98.9%) of the product 
synthesized by the anti-GAUTl immunoprecipitates was cleaved 
by treatment with EPG (Fig. 2A, compare columns 7 and 8 with 
column 9), demonstrating that the synthesized products are HG. 
Furthermore, polyacrylamide-gel electrophoresis of the prod- 
ucts generated by reacting anti-GAUTl immimoprecipitate with 



Positive Oonirol 



Backgraund Vsictoi Conycl JS33 



f;;ofG.AUT1 antiserum 

Fig. 1. GAUTl has GalAT activity. (A) GalAT activity in equal amounts of 
media from HEK293 cells transiently transfected with the JS33A1-43 (JS33) or 
JS36A1-41 (JS36) cDNA constructs or empty vector control and Immunoab- 
sorlaed with anti-HA antibodies conjugated to protein A-Sepharose. Solubi- 
llzed Arabidopsis protein with GalAT activity (inset) was used as a positive 
control. The average of time-0 reactions (background) is shown. Data are the 
average (±SE) of duplicate samples from 60-min reactions. Data were ana- 
lyzed for significance compared with empty vector controls by using a two- 
sample, one-tailed Student's ttest and an a priori a of 0.05 [95% confidence; 
tjS36 = 10.284 (P ~ 0.005); tjsss = 0.180 (P > 0.25)]. Comparable results were 
obtained in three separate experiments, (fi) Immunoabsorption of GalAT 
activity from Arabidopsis solubilized membrane proteins by using GAUTl 
antiserum. Solubilized Arabidopsis membrane proteins partially purified by 
SP-Sepharose chromatography were incubated with increasing amounts of 
GAUTl antiserum-coated Dynabeads, and proteins immunoabsorbed by anti- 
GAUTl antibodies were magnetically separated from the SP-Sepharose frac- 
tion. GalAT activity was measured in both anti-GAUTl immunoabsorbed 
(blacl< boxes) and depleted (open diamonds) fractions and compared with 
similar fractions obtained by using preimmune serum (Inset). Open bar, 
preimmune serum immunodepleted fraction; filled bar, preimmune serum 
immunoabsorbed fraction. 



UDP-GalA and trideca-GalA revealed the appearance of OGAs 
of increasing size (14-mer to 21-mer) in a time-dependent 
fashion (compare lanes 10-12 in Fig. 2B). Conversely, only a 
slight increase in the size of the OGAs was observed in the 
immunodepleted fraction (Fig. 2B, lanes 7 and 8). The sensitivity 
of the elongated OGAs made by anti-GAUTl immunoprecipi- 
tates to treatment with EPG (Fig. 2B, lane 13) clearly demon- 
strates that the GAUTl antiserum immunoabsorbs HG:GalAT 
activity. 

GAUTl and JS33 Are Part of a Multlgene Family in Arabidopsis. BLAST 
analysis of GAUTl amino acid coding region against the Ara- 
bidopsis genome identified JS33 and 13 additional coding se- 
quences with 36 -68% amino acid sequence identity and 56-84% 
sequence similarity to GAUTl (genes GAUT 1-15; see Table 3, 
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Fig. 2. Characterization of products made by anti-GAUTl immunoabsorbed 
protein. (A) Sensitivity of synthesized product to cleavage by EPG. Products 
synthesized during 2-h reactions containing buffer, UDP-p''C]GalA, OGA ac- 
ceptors, and theArabidopsis SP-Sepharose fraction, the SP-Sepharose fraction 
after immunodepletion of GAUT1 (GAUTl depleted), or the anti-GAUTl 
immunoabsorbed material (GAUTl enriched). Each fraction was incubated 
overnight with water, boiled EPG, or native EPG. Radiolabeled products 
recovered by using the filter assay are shown. Note that small oligomers (e.g., 
monomer and dimer) do not bind to the filters, (fi) Separation of nonradio- 
active GalAT reaction products by electrophoresis on a 30% polyacrylamide 
gel. Lanes 2-4 show the products recovered after incubation of SP-Sepharose 
purified solubilizedArab/dops/s membrane proteins with UDP-GalAand OGAs 
enriched for a DP 1 3 for 0, 3, and 22 h at 30°C. Lanes 6-8 show the same series 
of incubation times with the SP-Sepharose fraction after immunodepletion 
with GAUTl antiserum. Lanes 10-12 show similar series with the product 
synthesized by GAUTl-immunoabsorbed protein from the SP-Sepharose frac- 
tion. Lanes 5, 9, and 13 show product recovered after digestion of the 
respective 22-h reaction products with EPG. Lane 1 shows 0.1 f^g of OGA 
standard of DP 7-23. 



which is published as supporting information on the PNAS web 
site). An additional 10 sequences with reduced sequence identity 
to GAUTl (23-29% identity and 42-53% similarity) were also 
identified and named the GalAT-like (GATL) genes (GATL 
1-10). We suggest that all 25 Arabidopsis genes represent putative 
GalATs and have grouped them into a GAUTl-related gene 
superfamily (Table 3). 

Sequence alignment of the 25 proteins by using clustalx (27) 
revealed a high degree of alignment (see the first 25 entries in 
Fig. 9, which is published as supporting information on the PNAS 
web site) and the presence of several highly conserved domains 
(Fig. 3A) in addition to the DxD motif (25). The conserved 
GAUTl-related superfamily motif [H-[FWY]-[DNS]-G-x (2)-K- 
P-W-x (2)-[ILM]-[ADGS] is unique to the entire GAUTl- 
related superfamily. TheArabidopsis GAUTl-related superfam- 
ily members and orthologous proteins from rice, tobacco, and 



chickpea were the only sequences identified when this motif was 
used in a blast search of the SWISSPROT, TrEMBL, and BDP 
databases. Amino acid motifs were also identified that specifi- 
cally discriminate the GAUT and GATL subclasses within the 
GAUTl-related superfamily. The GAUT family motif (H-x 
(2)-[ILV]-x-[ST]-D-N-[IV]-[IL]-A-[ASTV]-S-V-V-[AIV]-x-S-x- 
[AIV]-x (2)-[AS]-x (2)-[PS]-x (3)-V-[FL]-H-[ILV]-[ILV]-T- 
[DN]-x (2)-[NST]-x (2)-[AGP]-[IM]-x (3)-F) identifies the 15 
Arabidopsis GAUT genes and their orthologues in other plants. 
The GATL motif W-M-x-[ILM]-Q-x (3, 4)-R-I-Y-[DEH]-L-G- 
S-L-P-P-F-L-L-[IV]-F-[AGS]-G-x-[IV]-x-[AP]-[IV]-[DENS]- 
H-[QR]-W-N-Q-H-G-L-G-G-D-N-[FILV]-x-[GS]-x-C-R identi- 
fies the 10 Arabidopsis GATL genes and their orthologues. 

GAUT genes are all predicted to encode proteins with mo- 
lecular masses between 61 and 78 kDa (Table 3). Most of the 
GAUT geaes are likely to encode type II membrane proteins (23) 
that contain a putative transmembrane domain in their hyper- 
variable N-terminal region (Fig. 3^4). In agreement with this 
predicted topology, GAUTl antiserum immunoabsorbs 
HG:GalAT activity, and such activity has been shown in pea to 
localize to the Golgi lumen (13). Furthermore, Arabidopsis 
GAUTl protein has been localized to the Golgi (28). Three of 
the GAUT proteins (GAUT 3, 4, and 5) contain an N-terminal 
signal peptide rather than a transmembrane domain (Table 3). 
GAUT2 is the only member of the GAUT family that is 
predicted to contain no N-terminal transmembrane domain and 
no signal peptide. The GATL proteins have molecular masses 
between 39 and 44 kDa and are predicted to contain only a signal 
pcplidc ;il iheir N termini, suggesting that these proteins are not 
inlcgial membrane proteins but are processed into the secretory 
pathway, a location consistent with a function in pectin synthesis. 

Ihe 25 proteins in the GAUTl-related superfamily belong to 
CAZy glycosyltransferase family 8 (24), which contains a total of 
40 Arabidopsis proteins. We suggest that a splitting of CAZy 
family 8 be considered, based on functional and sequence-based 
analyses. Many of the current family-8 proteins are functionally 
annotated as being galactosyl- or glucosyltransferases, activities 
that are clearly distinct from the GalAT activity identified for 
GAUTl and attributed to the other GAUTl-related proteins 
discussed here. Multiple sequence alignments of theArabidopsis 
family-8 proteins revealed only a single region of sequence 
similarity across all 40 Arabidopsis proteins centered about the 
DxD motif. Even the DxD region common to the 40 Arabidopsis 
family-8 proteins contains differences that permit a distinction 
between the GAUTl-related family and the other family-8 
proteins. For example, the GAUTl-related genes have a D- 
[DHS]-DxxxxxD motif, whereas the equivalent motif in the other 
family-8 proteins is D-[AG]-D that lacks the final D (Fig. 9). 
Furthermore, members of the GAUTl-related family have 
conserved amino acid motifs not found in any of the other 
Arabidopsis family-8 proteins (Fig. 3A). We believe that splitting 
family 8 as we suggest allows for evolutionarily more meaningful 
comparisons among this group of proteins. 

Phylogenetic Analysis of the GAUTl -Related Superfamily. Phyloge- 
netic analysis of the 25 GAUTl-related superfamily members 
show that they cluster into four distinct clades, with highly 
significant clade credibility values (Fig. 3B). The GAUT proteins 
cluster into three clades: GAUT-A (GAUTl-7), GAUT-B 
(GAUT8-12), and GAUT-C (GAUT12-15). All members of the 
GATL family cluster tightly into a distinct clade that is most 
closely related to GAUT15. 

Our phylogenetic analysis of these proteins differs in signifi- 
cant respects from a published analysis (29), carried out by using 
CLi'STALW, particularly with respect to the phylogeny of the 
GAUT proteins. Our Bayesian analysis was restricted to the 25 
GAUTl-related superfamily of proteins and included two pro- 
teins [GAUTll (Atlgl8580) and GAUT5 (At2g30575)] not 
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Fig. 3. Characterization of the Arabidopsis GAUT1- 
related gene superfamily. (A) Schematic representation 
of domain structure and conserved amino acid motifs in 
the Arabidopsis GAUT and GATL proteins. Positions of 
conserved residues found in the GAUT family are num- 
bered relative to GAUT1. Residues 22-44 represent a 
predicted transmembrane region. The positions of con- 
served residues in the GATL family are numbered rela- 
tive to GATL1. The conserved amino acid residues 
within the motifs diagnostic for the GAUT1 -related 
superfamily and the GAUT and GATL subfamilies are 
shown. (S) Phylogenetic analysis of the GAUT1 -related 
superfamily in A thaliana. Alignment of the complete 
sequences of all 25 members of the GAUT1 -related 
superfamily was carried out with clustalx (27) using 
parameters suggested by Hall (44). Bayesian analysis 
employing the program mrbayes (45, 46) was used to 
infer phylogenetic relationships among the members 
of the superfamily and to group the protein sequences 
into related clades. The analysis was carried out for 
100,000 generations using a mixture of amino acid 
transition parameter models. The phylogram pre- 
sented is the majority rule tree. Percentage clade cred- 
ibility values for each branch are given in parentheses. 



included previously. Our analysis also specifically excluded those 
family-8 proteins that show no substantial regions of sequence 
alignment with GAUTl. Bayesian analysis yields a different set 
of clades within the GAUT subfamily of proteins than observed 
previously. QUAl and GAUTl fall into clearly distinct clades in 
our analysis, whereas they had been grouped previously into the 
same clade. Functional characterization of additional members 
of the GAUTl-related superfamily should resolve which of the 
two models best describe the evolutionary relationships among 
these proteins. 

A BLAST search of the National Center for Biotechnology 
Information (NCBI) (www.ncbi.nlm.nih.gov/entrez/query.fcgi) 
and TIGR (www.tigr.org) databases indicates the presence of 
several protein sequences from other plant species with signif- 
icant identity to GAUTl. Phylogenetic analysis of available 
sequences from multiple species demonstrates that proteins with 
the most similarity to GAUTl are present in soybean 
(TC207293), barrel medic (TC108344), tomato (TC154813), 
maize (TC272016), chickpea (CAB81547), and Physcomitrella 
(Contig3305). A search of the rice genome also reveals multiple 
proteins with high sequence similarity to members of the 
GAUTl-related superfamily. For example, the rice proteins 
Q652K2 and Q5Z7Z8 appear to be orthologues of GAUTl. 



Other gene sequences from rice (e.g., BAC06990) cluster into 
specific clades within the Arabidopsis phylogenetic tree. These 
results strongly suggest that GAUTl and other members of the 
GAUTl-related superfamily are highly conserved in vascular 
and nonvascular plants. 

Proposed Role in Pectin Syntliesis. Pectin consists of HG, RG-I, 

RG-II, and, in some tissues and species, xylogalacturonan and 
apiogalacturonan (1). The precise nature of the linkages between 
these pectic polysaccharides in the wall remains controversial 
(30), although the available evidence supports a linkage via the 
backbones of the polymers (31). Clearly, the synthesis of pectin 
requires the coordinated action of numerous GalATs (at 
least one HG:GalAT, one RG-I:GalAT, and two to three 
RG-II:GalATs) (12). We propose that multiple members of the 
G^OTi -related gene family encode GalATs involved in differ- 
ent aspects of pectin synthesis. Consistent with this hypothesis, 
a survey of ihe Arabidopsis massively parallel signature sequenc- 
ing (MPSS) database, the Arabidopsis microarray database 
GENEVESTIGATOR (www.genevestigator.ethz.ch) (32), and 
the whole-genome arrays of Arabidopsis (33) show that most 
members of the GalAT superfamily are expressed, at varying 
levels, in all of the major tissues of Arabidopsis (see Table 4, 
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which is published as supporting information on the PNAS 

Our recent analysis of walls from homozygous mutants of 12 
members of the GAUTl -related gene family shows that mutants 
in 9 of the genes have significant reductions in the amount of 
GalA in their walls (K. Caffall and D.M., unpublished data), 
providing support for a function of other G/4Lrn -related genes 
as GalATs. Furthermore, plants carrying mutations in two 
members of the Arabidopsis GAUTl-ielated superfamily (gautS/ 
qual) and (gatll /parvus /glzl) have been described in refs. 10, 29, 
34, and 35. 

Qual mutant plants are dwarfed and have reduced cell 
adhesion and 25% reduced levels of GalA in their leaves. 
However, the pleiotropic effects of the mutation on glycosyl- 
transferase activities make it difficult to specify the enzymatic 
function of the mutated gene (35). Our studies show that 
GAUT8/0UA1 is a member of the GAUTl-related superfamily 
and support the argument that GAUT8/QUA1 functions as a 
GalAT. Definitive proof will require expression of enzyme 
activity. 

The parvus/glzl mutants are semisterile dwarf mutants. The 
neutral sugar compositions of parvus/glzl walls differ from 
wild-type walls in glycosyl residues found in RG-I. Because the 
levels of GalA in the parvus/glzl walls were not determined, it 
is not known whether parvus/glzl walls are altered in GalA 
content. Nevertheless, the phenotypes of plants carrying a 
mutation in the GATLl /PARVUS /GLZ and GAUT8/QUA1 
genes, together with the location of these proteins in different 
clades of the GAUTl-related superfamily, support the argument 
that GAUT8/QUA1 and GATLl/PARVUS/GLZ function as 
putative GalATs involved in pectin synthesis. 

1 he identification of GAUTl as a IKJ GalAT provides the 
molecular tools required to elucidate the biochemical mccha- 
nism(s) of I Kj and pectin synthesis. Biochemical and functional 
studies of GAUTl and other members of the GAUTl-related 
gene family are expected to increase our understanding of the 
biological roles of pectin in plants and may lead to enhanced 
agricultural productivity and developn^ent of pectin-based phar- 
maceuticals and industrial polymers. 

Materials and Methods 

For a detailed version of this section, see Supporting Materials 
and Methods, which is published as supporting information on 
the PNAS web site. 

Plant and Mammalian Cell Cultures. Cell-suspension cultures of A. 

thaliana (cv. Columbia) (22) were collected during exponential 
growth by filtration, washed extensively with water, and stored at 
-80°C until use. HEK293 cells (Edge Biosystems, Gaithersburg, 
MD) were grown in bicarbonate-buffered, Dulbecco's modified 
Eagle's medium (Sigma), pH 7.4, containing glucose (4.5 g/liter) 
and supplemented with 10% (vol/vol) FBS (Sigma), 0.6 g/liter 
L-glutamine, 100 /xg/ml streptomycin sulfate, and 100 units/ml 
penicillin. 

Preparation of SP-Sepharose-Purified Arabidopsis Membrane Pro- 
teins. Detergent-solubilized membrane proteins from suspen- 
sion-cultured Arabidopsis cells (22) were loaded onto a SP- 
Sepharose column and bound proteins eluted by using a NaCl 
step gradient (see Supporting Materials and Methods for details). 
Proteins eluting with 300 and 400 mM NaCl were pooled, 
desalted, and stored at -80°C until use. 

Partial Purification of GAUT1 and Identification by Liquid Chromatog- 
raphy Tandem MS. Desalted SP-Sepharose fraction was fraction- 
ated over Reactive yellow 3 and UDP-agarose columns, and the 
fraction most enriched for GalAT activity (see Supporting Ma- 
terials and Methods and Table 1 for details) was treated with 



sequencing grade, modified trypsin (Promega). Resulting pep- 
tides were dialyzed overnight and analyzed by using a 0-TOF2 
(Waters Micromass, Milford, MA) tandem mass spectrometer 
with a Waters CapLC delivery system. Survey MS spectra, 
acquired from 450-1,700 mass units, were used to identify 
peptide sequences. Sequences were probed against the Arabi- 
dopsis genome and proteins identified by using the Mascot 
search engine (www.matrixscience.com) and a peptide and tan- 
dem MS tolerances of ±2 and ±1, respectively. 

Cloning of JS33 (GAUT7) and JS36 (GAUT1). A fuU-length cDNA 
encoding JS33 (GenBank accession no. AY091448) in vector 
pUNI51 was obtained from the Arabidopsis Biological Resource 
Center (ABRC, Columbus, Ohio). N-terminally truncated JS33 
(JS33A1-43) lacking the putative transmembrane domain was 
generated by PCR using the sense primer FJS33Ntru (5'- 
ttattacccggggaattccacaatggctttcactctcctggattt-3') and the anti- 
sense primer RpUNI51 (5'-ggcggctatgatctcggcggccgctagaattg- 
3'). The primers contained restriction sites for Smal and NotI, 
respectively (underscored in the primer sequences). 

A version of JS36 (GenBank accession no. NM_1 15977) 
lacking its putative N-terminal transmembrane domain was 
generated (JS36A1-41) from total flower RNA by using RT-PCR 
and the gene-specific sense primer FJS36Ntru (5'-ttatta cccggg- 
gaattccgaggagtgtatatcgattcctcaaatg-3') and antisense primer 
RJS36 (5'-gcggccgcattttattcatgaaggttgcaacgacg-3'). The primers 
contained restriction sites for Smal and NotI, respectively (un- 
derscored in the primer sequences). 

Expression of JS33 and JS36 in HEK293 Cells. Purified PCR products 
for truncated JS33 and JS36 were cloned into pCR2.1-TOPO 

(Invitrogen), and Smal/NotI fragments were subcloned into the 
mammalian expression vector pEAKlO (Edge Biosystems) along 
with an N-terminal Hindlll/Smal fragment containing a T. cruzi 
mannosidase signal sequence, a polyhistidine tag, and two copies 
of the HA epitope (36). HEK293 cells (37) transiently trans- 
fected with 25-30 ixg of DNA were incubated at 37°C for 48 h 
before harvesting. 

Production of Polyclonal Antisera. Soluble, tetrameric multiple 
antigenic peptides corresponding to JS36 (GAUTl) amino acid 
sequences 448-472 (AMREYYFKADHPTSGSSNLKYRNPK) 
coupled to a trilysine core were synthesized at the Molecular 
Genetics Instrument Facility at the University of Georgia and 
used to produce polyclonal antibodies. Two rabbits (New Zea- 
land White) were immunized (0.5 ml of 0.3 mg'ml"^ antigen), 
maintained and bled by the Polyclonal Antibody Facility at the 
University of Georgia. 

Magnetic Bead Immunoabsorption of GalAT Activity. Solubilized 
SP-Sepharose -purified y4rafe'rfopsi$ membrane proteins containing 
GalAT activity were used for immunoprecipitation experiments 
(see Supporting Material and Methods for details). M-280 sheep 
anti-rabbit IgG-coupled Dynabeads (Dynal Biotech, Lake Success, 
NY) in PBS (6-7 X 10* beads per ml) were incubated for 2 h at 4°C 
with undiluted anti-GAUTl antiserum at a ratio of 3:1 (vol/vol). 
Conjugate was washed and mixed with Arabidopsis SP-Sepharose 
fraction (0-160 /xl of antiserum per 240 /xl of SP-Sepharose 
fraction) by rotation for 2 h at 4°C. Washed immunoprecipitates 
and anti-GAUTl-depleted supematants were analyzed by Western 
analysis and by using a GalAT filter assay (38). GalAT reactions 
containing 30 /xl of enzyme, 50 mM Hepes, pH 7.3, 0.2 M sucrose, 
0.05% (wt/vol) BSA, 25 mM KCl, 90 /xg of OGAs (DP 7-23), 6.9 
IjM UDP-D-["C]GalpA (specific activity 196 mCi-mmol-i) (1 
Ci = 37 GBq), and 1.25 mM MnCl2 in a total reaction volume of 
60 yl were incubated for 2 h at 30°C. Reactions were terminated by 
the addition of 10 /j,l of 0.4 M NaOH. 
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Exopolygalacturonase Digestion of GalAT Products. Reaction prod- 
uct (total of 70 /xl) was adjusted to pH 4.5 by the addition of 10 
/xl of 2 M acetic acid and 4.2 /xl of 1 M sodium acetate buffer, 
pH 4.2. The mixture was incubated overnight at 30°C with 2 /xl 
of water, a purified exopolygalacturonase (Aspergillus tubingensis 
EPG; EC 3.2.1.67, 0.54 mg/ml, 262 units/mg; 1 unit = 1 /xmol 
of reducing sugar produced per minute), or EPG that had been 
incubated at >95°C for 1 hour. The digestion reaction was 
terminated by addition of 23 /xl of 1 M NaOH. The final mixture 
was spotted onto cetylpyridinium chloride-coated filters and 
assayed by using the GalAT filter assay (38). 

GalAT Filter Activity Assays. UDP-D-["C]Gal;7A was synthesized as 
described in refs. 39 and 40. The GalAT activity assay was a 
modification of the procedure of ref. 16 as described in ref. 38. 

Nonradioactive GalAT PAGE Assay. The elongation of OGA accep- 
tors by GaL\T in the presence of UDP-GalA was assayed by 
separating reaction products on high-percentage acrylamide gels 
(see below). Reaction mixtures (15 /xl) containing 0.33 /xg/zxl OGAs 
(DP 13), 3 mM UDP-GalA, 1.9 mM MnCh, 50 mM Hepes, pH 7.3, 
200 mM sucrose, 25 mM KCl, 0.05% BSA, and 5 /xl of enzyme was 
incubated at 30°C for 3 h (unless otherwise indicated). Reactions 
were terminated by the addition of 700 /xl of chloroform/methanol 
(3:2) with vortexing and the mixture centrifuged for 5 min at 13,200 
X g. The supernatant was removed and the pellet resuspended in 
500 /xl of 65% ethanol by using a combination of vortexing and 
sonication. The mixture was centrifuged for 5 min at 13,200 X g and 
the supernatant removed. The pellet was air-dried for 5 min, 
resuspended in 50 /xl of sterile H2O, and either frozen at -20°C or 
analyzed directly. 



Samples were separated by PAGE and visualized by alcian 
blue/silver nitrate staining using a modification of the procedures 
of refs. 41 and 42 as described by ref. 43. Samples were mixed in a 
5:1 ratio with sample buffer [0.63 M Tris-Cl, pH 6.8, 0.05% phenol 
red, 50% (vol/vol) glycerol], loaded onto a stacking gel [5% 
(wt/vol) acrylamide, 0.64 M Tris, pH 6.8] and separated over a 30% 
acrylamide resolving gel [0.38 M Tris, pH 8.8, 30% (wt/vol) 
acrylamide (37.5:1 acrylamide/bis-acrylamide, wt/wt)] at 17.5 mA 
for 60 min. The gel was stained for 20 min with 0.2% alcian blue in 
40% ethanol and washed extensively with water (three times for 20 
seconds and twice for 10 min). The gel was incubated in 0.2% silver 
nitrate containing 0.075% formaldehyde, rinsed three times for 20 
seconds with water, and developed in 4% sodium carbonate con- 
tainijig 0.05% formaldehyde until bands appeared. The carbonate 
solution was removed, and staining was terminated by addition of 
5%' acetic acid. 
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Fig. 4. SDS/PAGE of partially punricdArabidopsis solubilizcd membrane proteins. 
Fractions (20 |ul) from the final UDP-agarose chromatography elution (see Table 1 , step 
5) were separated over a 10% SDS/PAGE gel and visualized by silver staining. Eluted 
fraction from the first UDP-agarose chromatography step is shown in lane 1; lanes 2-6 
show eluted fi-actions from the second UDP-agarose chromatography step [flow-through 
(lane 2); 50 mM UDP (lane 3); 25 mM MnCh (lane 4); 25 mM MnCb and 0.5 M NaCI 
(lane 5); 25 mM EDTA (lane 6)]. Galacturonosyltransferase activity present in each 
fraction is shown below the figure. Molecular weight markers (M) in kDa are shown on 
the left. *, The fi-action used for the identification of candidate galacturonosyltransferase 
proteins. 
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Fig, 5. Alignment of the amino acid sequences of JS33 (GAUT 7) and JS36 (GAUT 1). 
(A) The amino acid sequences of JS33 and JS36 were aligned by using CLUSTALX (1). 
The peptides identified by liquid chromatography tandem MS (LC-MS/MS) of tryptic 
peptides from the most purified GalAT-containing fraction from the UDP-agarose 
column are underlined. The arrows indicate the site of N-terminal truncation undertaken 



to remove the N-terminal and predicted transmembrane regions. The conserved DxD 
motif (2) is boxed. (B) Schematic representation of the gene constructs used in the 
expression of the JS genes in HEK293 cells. The locations of the N-terminal 
Trypanosoma cruzi mannosidase signal sequence (TCMss), the polyhistidine tag (lOx 
His-tag) and two copies of the hemagglutinin (HA)-epitope tag are indicated. 
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Fig. 6. RT-PCR expression analysis of JS33 and JS36 m Arabidopsis . Total RNA isolated 
from flowers (F), roots (R), stems (S), and rosette leaves (L) was used as a template for 
the RT-PCR of JS36 (GAUTl)- and JS33 (GAUT7)-specific transcripts. PGR products 

were separated on 1% agarose gels. The sizes of the amplified transcripts were 1,758 bp 
and 1,926 bp for JS33 and JS36, respectively. Primers directed against actin generated a 
495 -bp fragment that was used as a positive control. Positions of molecular weight 
markers (M) in kb are shown on the left. 



a: .. . -p 

M ST JS33 JS36 M ST JS3-! 

2S0 * 250 

iSO ^ ^ 150 

HH'^' 100 
^^^^^ 
^^^^^ 

/5 75 W 



SO 
37 



Fig. 7. Western blot analysis of media and cell lysates from transiently transfected 
HEK293 cells. JS33A1-43 (33) or JS36A1-41 (36) cDNA constructs were used to 
transfect 50% confluent (4-d-old) HEK293 cells. Media (A) or cell lysates (B) were 
obtained 48-li posttransfection, immunoprecipitated with 1 |Lig of anti-HA antibodies 
conjugated to 40 |id of protein A-Sepharose, separated on a 7.5% SDS/PAGE gel, and 
blotted onto PVDF membranes. The membranes were probed with a monoclonal anti-HA 
antibody because the recombinant JS33 and JS36 constructs contained an HA-epitope 
tag. Cells transfected with recombinant, histidine-tagged a-2,6-sialyltransferase (ST) 
lacking the HA epitope were used as a negative control. The positions of molecular 
weight markers (M) in kDa are shown on the left. The Western blot analyses show that 
the transiently transfected HEK293 cells expressed the recombinant enzymes in the 
media as expected. The estimated molecular weights of the recombinant JS33 (86 kDa) 
and JS36 (88 kDa) in both the media and cell lysates were much higher than those 
calculated from their amino acid sequences (73.2 kDa and 80.7 kDa for JS33 and JS36, 
respectively), suggesting that both proteins were posttranslationally modified in the 
HEK293 cells. JS36 was consistently expressed at lower levels in the fransiently 
transfected and stably transformed HEK293 cells than JS33. 
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Fig. 8. Western blots of the GAUTl -depleted fractions (A) and GAUTl -enriched 
fractions (B) from the anti-GAUTl immunoabsorption of GalAT activity from SP- 

Sepharose purified solubilized Arabidopsis membrane proteins described in Fig. \B (see 
text). Equal proportions of depleted fraclions and anli-GAUTl-conjugated magnetic 
beads were analyzed by immunoblotling with GAUTl antiserum. Similar fractions 
obtained by using 160 |u,l of preimmune serum were used as controls. Arrows indicate the 
«60 kDa protein band detected by GAUTl antiserum. IgG, IgG heavy chain detected by 
the secondary antibody used in Western blotting. Immunodepletion of GalAT activity 
from the SP-Sepharose fraction was accompanied by the disappearance of the 60-kDa 
protein band (^4). Likewise, there was a dose-dependent appearance of both GalAT 
activity and the 60-kDa band in anti-GAUTl immunoabsorbed fractions (5), suggesting 
that the 60-kDa band may represent a form of GAUTl . However, several other bands 
also appear to specifically react with the GAUTl antiserum {B; compare 
immunoabsorbed and preimmune fractions) and the identity of these proteins remains to 
be determined. GAUTl encodes a predicted protein of 77.4 kDa; thus, the consistent 



recognition and immunoprecipitation by GAUTl antiserum of the 60-kDa protein in 
Arabidopsis fractions enriched for GalAT activity, and the correlation of the presence of 
the 60-kDa band with GalAT activity, could indicate that the 60-kDa protein is a 
proteolytically cleaved or otherwise processed version of GAUTl. The structure of the 
protein is under investigation. 



Supporting Figure 9 

Fig. 9. Sequence alignment of the 25 GAUTl -related superfamily proteins and other 

members of Arabidopsis glycosyltransferase family 8. The result of a multiple sequence 
alignment of 37 of 40 Arabidopsis family-8 proteins carried out by using CLUSTALX is 
shown. Three proteins (At2g35710, At4gl6600, and At5gl8480) were not included in the 
alignment because we were unable to find alignment parameters that would align the 
DxD motifs present in these three proteins with the corresponding motif in the other 37 
family-8 proteins. The symbols below the sequence alignments are those defined in the 
CLUSTALX output. *, single conserved amino acid residue; :, fiiUy conserved "strong" 
groups of amino acid residues; •, fiiUy conserved "weak" groups of amino acids. The first 
25 proteins listed arc members of the GAUTl -related superfamily. The first 15 are 
GAUT proteins and the next 1 0 arc GATL proteins. GAUTl (At3g61 130) is in bold type. 
The remaining proteins listed in italics are other members of Arabidopsis 
glycosyltransferase family 8. None of the 15 members of family 8 that do not belong to 
the GAUTl -related gene family share significant sequence alignment with the GAUTl- 
related gene family proteins. 



Table 1, Partial purification of enriched galacturonosyltransferase-containing 
Arabidopsis protein fraction used for tandem MS 



Step 


Sample 


Total activity, fimol'min ' 


Yield, % 


1 


i Solubilized membrane proteins* 


57.2 


100 


2 


SP-Scpharosc 


37.5 


65.6 


3 


Reactive yellow 3 


17.5 


30.6 


4 


UDP-agarose (1)^ 


7.09 


12.4 


5 


UDP-agarose (2)* 


1.35 


2.4^^ 



* Specific activity, 26 pmol»min~^»mg~\ 
^Specific activity, 121 pmol»min~^»mg~\ 

•''Because of high detergent and low protein levels, protein concentration not determined. 
^Fraction used for tandem MS. 



Table 2. List of proteins identified in the partially purified 
galacturonosyltransferase-containing ft-action recovered after repetitive UDP- 
agarose chromatography 



i GenBank accession no. 


Predicted 
molecular mass, kDa 


Proposed function* 


Mowsc score' 


i BAB01930 


65.9 


P-fructofuranosidase 


282 


AAD05539 


102 


a-xylosidase precursor 


236 


AAL15216 


42.0 


putative methionyl-tRNA synthetase 


168 


i BAB40450 


76..6 


long-chain acyl-CoA synthetase 


112 


: AAD29817 


61.7 


synaptotagmin A 


100 


JS33/NP_565893 


69.7 


putative glycosyltransferase^ 


110 


BAB02117 


48.5 


unknown 


81 


BAB 10278 


35.7 


unknown 


77 


JS36/NP_191672 


77.4 


putative glycosyltransf erase 


73 


BAA20519 


63.3 


L-ascorbate oxidase 


73 


CAB78227 


97.4 


putative phospholipase D 


71 



*The proposed fLinctions were previously inferred by electronic annotation. 
^Mowse scores >60 are significant based on an a priori a of 0.05. 



^Proteins representing putative glycosyltransferases are in bold type. 



Table 3. The Arabidopsis GAUTl -related gene superfamily 



Gene* 


GcnBank 
accession 
no.^ 


Prcdiclcd 
molecular 
mass, kDa 


Amino acid 
identity/similarity''' 


SA versus 
SP^ 


Cladc 


GAUTI/JS36/LGTI 


Al3g6 1 1 30 


77.3 


100/100 


SA 


GAUT-A 


GAUT2/LGT2 


At2g46480 


62.1 


65/78 


none 


GAUT-A 


\ GAUT3 


At4g38270 


77.8 


68/84 


SP 


GAUT-A 


I GAUT4/JS36L/LGT3 


At5g47780 


71.1 


66/83 


SP 


GAUT-A 


i GAUT5/LGT5 


At2g30575 


69.9 


45/67 


SP 


GAUT-A 


GAUT6/LGT9 


Atlg06780 


67.5 


46/64 


SA 


GAUT-A 


: GAUT7/JS33/LGT7 


At2g38650 


69.7 


36/59 


SA 


GAUT-A 


\ GAUT8/QUA1 


At3g25140 


64.4 


58/77 


SA 


GAUT-B 


\ GAUT9 


At3g02350 


64.2 


57/76 


SA 


GAUT-R 


I GAUT10/LGT4 


At2g20810 


61.8 


50/72 


SA 


GAUT-B 


i GAUTll 


Atlgl8580 


61.9 


51/71 


SA 


GAUT-B 


: GAUT12/LGT6/IRX8 


At5g54690 


60.9 


40/61 


SA 


GAUT-C 


\ GAUT13 


At3g01040 


61.1 


43/62 


SA 


GAUT-C 


\ GAUT14 


At5gl5470 


65.3 


43/62 


SA 


GAUT-C 


I GAUT15 


At3g58790 


60.6 


37/56 


SP/SA 


GAUT-C 


GATLl/PARVUS/GLZl 


Atlgl9300 


39.0 


29/49 


SP 


GATL 


; GATL2 


At3g50760 


32.5 


27/52 


SP 


GATL 



i GATL3 


Atlgl3250 


39.9 


23/43 


SP 


GATL 


: GATL4 


At3g06260 


40.3 


29/51 


SP 


GATL 


i GATL5 


Atlg0272() 


41.2 


25/44 


SP 


GATL 


I GATL6/LGT10 


At4g02130 
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*The name given to each member of the GAUTl -related gene family includes its 
designation within the LGT family [Tavares, R., Aubourg, S., Lechamy, A. & Kreis, M. 
(2000) Plant Mol. Biol. 42, 703-717] or the names of any characterized ^ra^/Jop^w gene 
mutants (1-4). The numbering of the GAUT and GATL genes is based on the 
phylogenetic analysis of the family (see Fig. 3B). 



^From The Arabidopsis Information Resource database or National Center for 
Biotechnology Information. 

•''Sequence identity/similarity is compared to 397 amino acids of GAUTl starting at 
amino acid position 277. 

^The presence of either a putative transmembrane domain (SA; signal anchor) or a signal 
peptide (SP) sequence at the N-terminus of each gene was determined using SignalP 
version 3.0 (www.cbs.dtu.dk/services/SignalP) and confirmed using TMpred 
(www.ch.embnet.org/software/TMPRED_form.html). 
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Table 4. Expression analysis of the GAUTl-related superfamily 
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^Presence or absence of MPSS signatures for specific galacturonosyltransferase 
superfamily genes according to the Arabidopsis MPSS database 
(http://mpss.dbi.udel.edu). 

^Presence or absence of transcripts determined by whole genome array (1). 

•'■Presence or absence of transcripts in GENEVESTIGATOR database (2) determined by 
whole genome array. 

^NA, gene not available in database. 

1. Yamada, K., Lim, J., Dale, J. M., Chen, H., Shinn, P., Palm, C. J., Southwick, A. M., 
Wu, H. C, Kim, C, Nguyen, M., et al. (2003) Science 302, 842-846. 

2. Zimmermann, P., Hirsch-Hoffinann, M., Hennig, L. & Grruissem, W. (2004) Plant 
Physiol. 136, 2621-2632. 



Supporting Materials and Methods 

Chemicals and Kits. Uridine diphosphate-a-D-[^''C]galactose (UDP-D^^^CJOal/?, 11. 0 
GBq/mmol), PD-10 colvimns, and SP-, protein A- and protein G-Sepharose were 
purchased fi-om Amersham Pharmacia Biosciences. A mixture of oligogalacturonides 
with a degree of polymerization (DP) of 7-23 was prepared according to refs. 1 and 3. 
UDP-agarose was purchased fi-om Calbiochem (La JoUa, CA). Nickel-nitrilotriacetic acid 
agarose (Ni-NTA) resin, plasmid maxiprep, and gel extraction kits were purchased from 
Qiagen (Valencia, CA). EZNA plasmid miniprep kits were purchased from Omega Bio- 
tek (Doraville, GA). Restriction enzymes were purchased from New England Biolabs 
(Beverly, MA). Highly purified, monoclonal anti-hemagglutinin (anti-HA) antibodies 
were purchased from Covance (Berkeley, CA). Anti-HA conjugated to horseradish 
peroxidase was purchased from Roche. The 3,3',5,5'-tetramethylbenzidine (TMB) 
substrate kits were purchased from Vector Labs (Burlingame, CA). PVDF membranes 
were purchased from Schleicher & Schuell. Protein concentrations were determined by 



the Bradford assay (Bio-Rad) using BSA as the standard. Reactive yellow 3 resin and all 
other chemicals were purchased from Sigma. 

Plant Material. Arabidopsis thaliana (cv. Columbia) cell-suspension cultures derived 
from leaf calli were grown in the dark for 7-10 days in Gamborg's B-5 basal medium 
containing 58.4 mM sucrose, 9 |uM 2,4-dichlorophenoxyacetic acid, and 0.23 |uM kinetin 
as described in ref 2. 

Mammalian Cell Culture. Human embryonic kidney (HEK) 293 cells (Edge 
Biosystems; Gaithersburg, MD) were a gift of Kelley Moremen (University of Georgia, 
Athens). HEK293 cells were grown in 150-cm culture dishes (Coming) in bicarbonate- 
buffered, Dulbecco's modified Eagle's medium (Sigma), pH 7.4, containing glucose (4.5 
g/Iiter) and supplemented with 10% (vol/vol) FBS (Sigma), 0.6 g/liter L-glutamine, 100 
|u,g/ml streptomycin sulfate, and 100 units/ml penicillin. Cells were maintained at 37°C in 
a humidified incubator (Thermo Electron, Newington, NH) containing 5% CO2 and 
subcultured when the cells were 80% confluent. 

Preparation of SP-Sepharose-Purified Arabidopsis Membrane Proteins. A detergent- 
solubilized membrane fraction from suspension-cultured Arabidopsis cells containing 
GalAT activity was prepared as described in ref. 2. The solubilizcd GalAT fraction (125 
ml) was loaded twice onto a 50-ml column of SP-Sepharose cation exchange resin (5 x 
2.5 cm) equilibrated with buffer A [50 mM Hepes, pH 7.3, 0.25 mM MnCl2, 1% (vol/vol) 
Triton X-100, 2 mM EDTA, and 25% (vol/vol) glycerol]. The column was washed with 3 
volumes of buffer B [50 mM Hepes, pH 7.3, 0.25 mM MnCb, 25% (vol/vol) glycerol, 
2.5% (wVvol) 3-[(3-cholamidopropyl)dimethylammonio]propanesulfonic acid CHAPS)]. 
Bound proteins were eluted by using a step gradient of NaCl in buffer B as follows: 50 
ml of 100 mM, 50 ml of 200 mM, 35 ml of 300 mM, and 70 ml of 400 mM. Proteins 
eluting with 300 and 400 mM NaCl were pooled and desalted in buffer B over PD-10 
columns according to the manufacturer's instructions. All operations were carried out at 
4°C. 

Reactive Yellow 3 Chromatography. The pooled and desalted fraction from the SP- 
Sepharose column was loaded onto a 15 ml column (2x5 cm) of Reactive yellow 3 
(RY3) equilibrated with buffer B. Bound proteins were eluted by the sequential addition 
of one colvmin volvmie of buffer B containing (z) 10 |uM UDP, (zz) 10 |uM UDP and 2 |uM 
OGAs, (zzz) 10 |LiM UDP, 2 |iiM OGAs and 100 mM NaCl, and (zv) 10 |uM UDP, 2 |uM 
OGAs, and 200 mM NaCl. The proteins that eluted with 100 and 200 mM NaCl were 
pooled and desalted in buffer B using PD-10 columns. 

UDP-Agarose Chromatography. The pooled RY3 fractions were loaded onto a 7-ml 
(2.1 X 2 cm) UDP-agarose column equilibrated with buffer B. Bound proteins were 
sequentially eluted with one column volume each of 0.01, 0.05, 0.1, 0.5, 1, 5, 10, 15, 20 
mM UDP and 1 M NaCl in buffer B. All of the eluted fractions were desalted and tested 
for GalAT activity. The fraction that eluted with 1 M NaCl and contained the greatest 
amovmt of GalAT activity was reapplied to the UDP-agarose column (Table 1) and bound 
proteins were eluted with one column volume of buffer B containing 0.01 mg/ml OGAs 



and (0 50 mM UDP, (//) 25 mM MnCla, (iii) 25 mM MnClj and 0.5 M NaCl, (iv) 25 mM 
EDTA, and (v) 25 mM EDTA and 0.5 M NaCl. OGAs were added to the elution buffers 
to inhibit the activity of a contaminating exopolygalacturonase-like activity that 
copurified with GalAT activity. The fraction that eluted with 25 mM MnCb and 0.5 M 
NaCl was most enriched for GalAT activity and selected for further analysis by treatment 
with trypsin, followed by liquid chromotography tandem MS (LC-MS/MS). 

Trypsin Digestion and LC-MS/MS Analysis of Arabidopsis Peptides. The most 
purified fraction from UDP-agarose chromatography was treated for 12 h at 37°C with 
940 units of sequencing grade, modified trypsin (Promega). Resuhing peptides were 
dialyzed overnight against water by using Spectro/Por CE 1,000 molecular-weight-cutoff 
dialysis membranes (Spectrum Laboratories, Rancho Dominguez, CA), lyophilized, and 
analyzed by LC MS/MS. The peptides were introduced into a Q-T0F2 (Waters 
Micromass, Milford, MA) tandem mass spectrometer using a Waters CapLC delivery 
system with gradient mobile phases of aqueous 0.1% (vol/vol) formic acid (A) and 
acetonitrile (B) and gradient conditions of 10% B to 70%) B over 60 min at a flow rate of 
1 |ul/min. The Q-T0F2 mass spectrometer was operated in a data-dependent scan mode 
and survey MS spectra, acquired from 450-1,700 mass units, used to identify peptide 
sequences. The switch criteria for MS to MS/MS mode were ion count and charge state, 
with the Q-TOF set to ignore singly charged ions and to acquire MS/MS data for up to 
three coeluting peptides. Collision energy was varied automatically depending on the 
peptide mass and charge state. The sequences were probed against the Arabidopsis 
genome, and proteins were identified by using the Mascot search engine 
(www.matrixscience.com) and a peptide and MS/MS tolerance of ±2 and ±1, 
respectively. 

RT-PCR. Total RNA from flowers, rosette leaves, or stems obtained from 3- or 6-week- 
old A. thaliana ecotype Columbia plants as well as total RNA from roots of 4-week-old 
Arabidopsis plants grown in liquid media were gifts of Maor Bar-Peled (University of 
Georgia) (4). First strand cDNA synthesis was conducted according to the manufacturer's 
instructions by using 1 mM oligo(dT) primer, 200 units of Superscript II reverse 
transcriptase (Invitrogen), and 8 |iig of total RNA. Products from these reactions were 
treated with 2 units of RNase H (Invitrogen) and used as a template for PCR using gene 
specific primers. PCR reactions were conducted using high fidelity Platinum Taq 
polymerase (Invitrogen) according to manufacturer's instructions. Typical PCR reactions 
used 1 cycle at 95°C for 2 min, followed by 30 cycles at 95°C for 30 s, 52°C for 30s, and 
70°C for 2.5 min and a final extension at 70°C for 5 min. The PCR products were 
analyzed on Tris-acetate EDTA (TAE) \% (wt/vol) agarose gels, and DNA bands of the 
appropriate size were excised and purified by using a QIAquick gel extraction kit 
(Qiagen). Control PCR reactions included actin primers ACTl 19S and ACT284A (5) as a 
positive control [a gift from Richard Meagher (University of Georgia)]. 

Transcript analysis of JS33 (GAUT7) and JS36 (GAUTl) in Arabidopsis tissues was 
conducted by RT-PCR using the primers FJS33Ntru and RJS33 (5'- 
aagcattggccgtgtagctacttac-3') and FJS36Ntru and RJS36 for JS33 and JS36, respectively. 



Expression of JS33 and JS36 in HEK293 Cells. Purified PCR products for truncated 
JS33 and JS36 were cloned into pCR2.1-T0P0 (Invitrogen) and Smal/NotI fragments 
were subcloned into the mammalian expression vector pEAKlO (Edge Biosystems) along 
with an N-terminal Hindlll/Smal fragment containing a Trypanosoma cruzi mannosidase 
(TCMss) signal sequence (MRLLTALFAYFIVALILAFSVSAKARR), a polyhistidine 
tag (HHHHHHHHHH), and two copies of the hemagglutinin (HA) epitope 
(YPYDVPDYA) (6). Gene constructs in the pEAKlO vector were confirmed by DNA 
sequencing. 

Transient transfection and stable transformation of HEK293 cells (7) were carried out by 
using 25-30 |ug of DNA. Transiently transfected cell lines were incubated at 37°C for 48 
h before harvesting. Stably transformed cell lines were selected on media containing 

puromycin(l |iig/ml). 

Stably transformed cell lines were confirmed by PCR amplification of genomic DNA 
and/or RT-PCR of total RNA using the gene specific primers FJS33Ntru and RJS33, and 
FJS36Ntru, RJS36, and 36IP (5'-tgattctgcagtccttggtaaatacag-3') for JS33 and JS36, 
respectively. Cells transformed with empty pEAKlO vector or expressing a recombinant, 
histidine-tagged, rat a-(2,6)-sialyltransferase (8), a gift of Kelley Moremen (University of 
Georgia), were used as negative controls. Control RT-PCR reactions conducted to test 
transcript levels in HEK293 cells included the use of primers (FP 5'-caatgaccccttcattgacc- 
3', RP 5'-gtcttctgggtggcagtgat-3') specific to positions 123 and 584, respectively, of 
human glyceraldehyde 3'-phosphate dehydrogenase (accession no. BC023632) (gifts 
from Michael Pierce, University of Georgia). 

Ni^'^-Affinity Column Chromatography. A recombinant rat a-(2,6)-sialyltransferase 
containing an N-terminal polyhistidine tag but no hemagglutinin epitope tag (gift of 
Kelley Moremen) was purified for use as a negative control for Western blot analysis of 
recombinant GAUTl (JS36). Media from an HEK293 cell line expressing the 
recombinant a-(2,6)-sialyltransferase was desalted over a PD-10 column equilibrated 
with buffer C (50 mM KH2PO4, pH 7.5, 300 mM NaCl, and 40 mM imidazole). The 
desalted media (30 ml) was poured over a 200-|ul column of Ni-NTA agarose equilibrated 
with buffer C and the column was washed with 10 volumes of buffer C. Proteins bound 
to the Ni-NTA resin were denatured in 0.2 volumes of SDS/PAGE sample buffer (9) and 
separated by SDS/PAGE. 

Immunoprecipitation of Recombinant JS33 and JS36 Using Anti-HA Antibodies. 

Protein A-Sepharose, equilibrated in ice-cold PBS/137 mM NaCl, 2.7 mM KCl, 4.3 mM 
Na2HP04, and 1.4 mM KH2PO4, pH 7.3), was incubated with 1 |ug of purified 
monoclonal anti-HA antibody (Covance, Princeton, NJ) at a ratio of 30: 1 (vol/vol) for 2 h 
unless otherwise specified. The resulting protein A-Sepharose/anti-HA conjugate was 
washed tree times with 10 volumes of cold PBS, brought to a 50% slurry with PBS 
(media) or cell-lysis buffer (cell lysates), and kept at 4°C until use. 

Media (3.75 ml) from JS33- and JS36-expressing cell lines and vector alone cell lines, 
was harvested, brought to 10% (vol/vol) glycerol, and incubated with a 60-|li1 slurry of 



protein A-Sepharose/anti-HA conjugate in the presence of 3 10 |u,l of PBS and 80 mg of 
BSA for 1 h. Proteins bound to the protein A-Sepharose/anti-HA conjugate were washed 
four times with 10 volumes of 0.1% (vol/vol) TX-lOO, 50 mM Tris'Cl, pH 7.4, and 300 
mM NaCl followed by 2 washes with 10 volumes of PBS and used immediately for 
galacturonosyltransferase activity assays (10 |li1) or SDS/PAGE (10 |li1). 

Cell lysates from HEK293 Hues were prepared by incubating one 150-cm dish of cells 
with 1 ml of cold cell-lysis buffer [50 mM Hepes, pH 7.3, 1% (vol/vol) TX-lOO, 150 mM 
NaCl, 2 mM EDTA, and 1 tablet of Complete Protease Inhibitor Mixture (EDTA-free; 
Roche)] for 30 min. Cell debris was pelleted by centrifugation, and the clear supernatant 
was incubated with 40 |ul (for Western blots) or 160 |ul (for galacturonosyltransferase 
activity assays) of protein A-Sepharose/anti-HA conjugate for 1 h. Immunoprecipitates 
were washed either five times with 10 volumes of buffer A (for activity assays) or three 
times with 10 volumes of buffer A, followed by 2 washes of 10 volumes of PBS (for 
Western blot). All manipulations were done at 4°C. 

Production of Polyclonal Antisera. The criteria used to select the peptide sequence 
selected for MAP production and antibody production included uniqueness to GAUTl 

(determined from sequence alignments using CLUSTALX, version 1.83) (10), low 
hydrophobicity (hydropathy plots) (1 1), high P-turn character (12) determined by using 
PROTSCALE (http://au.expasy.org/cgi-bin/protscale.pl) and significant loop character 
[determined using GENTHREADER (http://bioinf.cs.ucl.ac.uk/psipred)]. 

Magnetic Bead Immunoabsorption of GalAT Activity. A solubilized Arabidopsis 
membrane protein preparation (see above) containing GalAT activity (20 ml) was loaded 
onto a 5-ml column of SP-Sepharose equilibrated with buffer A [50 niM Hepes, pH 7.3, 
0.25 mM MnClz, 2 mM EDTA, 25% (vol/vol) glycerol, 1% (vol/vol) Triton X-100]. The 
column was washed with 30 ml of buffer A, and bound proteins were eluted by using a 
step gradient of NaCl in buffer A (5 ml each of 100 mM, 200 mM, 300 mM, and 400 mM 
NaCl). The last 2.5 ml of the 300 mM NaCl fi-action was pooled with the 400 mM NaCl 
fraction and desalted over a PD-10 column according to the manufacturer's instructions. 
This pooled, desalted fraction (referred to as the Arabidopsis SP-Sepharose fraction) was 
used for all immunoprecipitation experiments. 

M-280 sheep anti-rabbit IgG-coupled Dynabeads (Dynal Biotech, Lake Success, NY) 
were washed with PBS (pH -7.3, 137 mM NaCl, 2.7 mM KCl, 4.3 mM Na2HP04, and 
1 .4 mM KH2PO4), resuspended in PBS (6-7 x 10^ beads per ml), and incubated with 
undiluted anti-GAUTl antiserum at a ratio of 3:1 (vol/vol) for 2 h at 4°C according to the 
manufacturer's instructions. The conjugate was washed three times in an excess volume 
of PBS and another three times in an excess volvime of buffer A by using a magnetic 
particle concentrator (Dynal MPC-S; Dynal Biotech). 

Immimoprecipitation was performed by mixing Arabidopsis SP-Sepharose fraction with 
increasing amoimts of the Dynabead/anti-GAUTl antisera conjugate (representing an 
equivalent of 0 to 160 |li1 of antiserum per 240 |li1 of SP-Sepharose fraction). The 
conjugate:SP-Sepharose fraction mixture was rotated for 2 h at 4°C, and 



immunoprecipitates were washed either five times with an excess volume of buffer A 
(activity assays) or three times with buffer A, followed by two washes with PBS 
(Western blots). 

GalAT assays of the SP-Sepharose fraction, the anti-GAUTl -depleted supematants, and 
the anti-GAUTl immunoabsorbed proteins (resuspended in buffer A) were done by using 
a filter assay (13). GalAT reactions containing 30 |li1 of enzyme, 50 mM Hepes, pH 7.3, 
0.2 M sucrose, 0.05% (wt/vol) BSA, 25 mM KCl, 90 |ug of OGAs (DP 7-23), 6.9 |uM 
UDP-D-[^''C]Gal/7A (specific activity 196 mCi'mmol"^), and 0.125 mM MnCb in a total 
reaction volume of 60 |ul were incubated for 2 h at 30°C. Reactions were terminated by 
addition of 10 |li1 of 0.4 M NaOH. 

SDS/PAGE and Immunoblotting. SDS-PAGE was carried out according to a method 
described in ref 14 using 7.5% or 10% SDS/PAGE gels. Proteins were visualized by 
silver staining (15), Coomassie staining (Bio-Safe Coomassie G250 stain, Bio-Rad), or 
transferred to PVDF membranes for 2 h at 200 rtiA in 25 mM Tris, 192 mM glycine, and 
20% (vol/vol) methanol. Membranes were blocked for 1 h in 3% BSA in Tris-buffered- 
saline [TBS; 100 mM Tris'Cl, pH 7.5, and 0.9% (wt/vol) NaCl] and incubated with a 
1:1,000 dilution of anti-HA-peroxidase antibody in blocking buffer for 1 h. Membranes 
were washed four times in TTBS [0.1% (vol/vol) Tween 20 in TBS] and developed by 
using a 3,3',5,5'-tetramethylbenzidine (TMB) substrate kit (Vector Labs, Burlingame, 
CA) according to manufacturer's instructions. 

Synthesis of UDP-D-["C]GalpA. UDP-D-[^''C]GalpA was synthesized based on 
methods described in refs. 16 and 17 with some modifications. Briefly, 50 |iiCi of UDP-d- 
[^''CJGal/? was purified over a CarboPac PA-1 column (Dionex; Sunnyvale, CA) by using 
a 0.05-1 M linear gradient of ammonium formate, pH 6.6, at a flow rate of 1 ml/min to 
ensure removal of contaminants that sometimes inhibit conversion of UDP-Gal to UDP- 
GalA. Purified UDP-D-['''C]Gal/? was lyophilized and resuspended in 0.5 ml of 25 mM 
NaH2P04, pH 7.0, containing 250 units of galactose oxidase and 13,000 units of catalase. 
The reaction was incubated in the dark at room temperature for 24 h, after which another 
250 units of galactose oxidase and 13,000 units of catalase were added. The reaction was 
incubated for another 24 h in the dark and terminated with 600 f-il of 5:1 (vol/vol), 
chloroform/methanol. The resulting mixture containing UDP-D-[^''C]Gal/>A was 
extracted and purified as described in ref 16. The typical yield of UDP-D-[^''C]Gal/?A 
from the starting UDP-D-[^V]Gal/? was between 60% and 70%. 

Galacturonosyltransferase Filter Activity Assays. The galacturonosyltransferase 
activity assay used for determination of incorporation of radioactive [^''CJgalacturonic 
acid from UDP-D-['''C]GalA was a modification of the procedure of ref 1 as described in 
ref 13. Protein fractions (10 |li1) were incubated in a total volume of 30 |ul for 15 min at 
30°C in reaction buffer [50 mM Hepes, pH 7.8, 0.2 M sucrose, 0.05% (wt/vol) BSA, and 
25 mM KCl] containing 80 |ig of oligogalacturonides (DP of 7-23), 1.1 |uM UDP-D- 
[^^C]Gal/7A (specific activity 298 mCi'mmol"^), 0.25 mM MnCl2, and 140 ^M UDP-D- 
Gal/>A. Assays of immimoprecipitated media from HEK293 cells were as above, except 



that the reaction contained 2 UDP-D-[^''C]Gal/7A and 100 [iM UDP-Gal/?A. 
Reactions were terminated by the addition of 5 |ul of 0.4 M NaOH. 

Terminated reaction mixtures were spotted onto cetylpyridinium chloride-coated 
Whatman 3 -MM paper (4 cm^) as described in ref 13. The filters were allowed to dry for 

5 min and were washed three times for 10 min each in 150 mM NaCl at a ratio of 1 filter 
per 100 ml of solution. The amount of radioactivity bound to each filter was estimated by 
scintillation counting. Statistical analysis of galacturonosyltransferase reaction data were 
performed by using a two-sample, one-tailed Student's t test (18) with an a priori a of 

0. 05 (95% confidence). 

Analysis of Conserved Amino Acid Motifs. The sequence analysis program PRATT 2.1 
(http://ca.expasy.org/tools/pratt) was used to identify conserved amino acid sequence 
patterns among the proposed 25-memher Arabidopsis GAUTl -related superfamily and to 
distinguish the subgroups of that family: the 15 GAUT and \0 GATL genes. Conserved 
protein sequences were scanned against SWISSPROT, TrEMBL, and BDP by using 
SCANPROSITE, http://ca.expasy.org/tools/scanprosite. The presence of GAUTl and other 
family members in the CAZy glycosyltransferase family 8 was determined by using the 
Carbohydrate Active Enzymes database at http://afmb.cnrs-mrs.fr/CAZY. 

Sequence Alignment of GAUT Family Genes and Phylogenetic Analysis. Arabidopsis 
genes with high sequence identity to GAUTl were aligned by using CLUSTALX (19) and 
parameters suggested by Hall (20) and phylogenetic analysis was done by Bayesian 
analysis employing MRBAYES (21). Phylogenetic analyses that included EST (GenBank, 
http://www.ncbi.nlm.nih.gov/dbEST) or tentative consensus (TC, 
http://www.tigr.org/tdb/tgi) sequences from soybean (Glycine max), barrel medic 
(Medicago truncatula), rice (Oryza sativa), tobacco {Nicotiana tabacum), and chickpea 
{Cicer arietinum) was conducted by using the program PAUP, version 3.1 (22). 
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At3g61130 MALKRGLSGVNRIRGSGGGSRSVLVLLIFFCVFAPLCFFVGRGVYIDSSNDYSIVSVKQN 60 

At2g46480 

At4g3827 0 MSTICSHRELKAYRPLQDNNLQEVYASSAAAVHYDPDLKLLSQDVNIVATYSDHYG 5 6 

At5g477 8 0 MVKLRNLVLFFMLLTVVAHILLYTDPAASFKTPFSKRDFLEDVTALTFNSDENR 54 

At2g3057 5 MNQVRRWQRILILSLLLLSVLAPIVFVSNRLKSITSVDRGEFIEELSDITDKTEDEL 57 

Atlg0 67 8 0 MKQIRRWQRILILALLSISVFAPLIFVSNRLKSITPVGRREFIEELSKIR-FTTNDL 5 5 

At2g3B650 MKGGGGGGGGGGGGKRRWKVLVIGVLVLVILSMLVPLAFLLGLHNGFH 48 



At3g25140 
At3g02350 
At2g20810 
AtlglB580 
At5g54690 
At3g01040 
At5gl5470 
At3g58790 
Atlgl9300 
At3g50760 
Atlgl3250 
At3g06260 
Atlg02720 
At4g02130 
At3g62660 
Atlg24170 
Atlg70090 
At3g28340 



Atlg54940 MG 2 

Atlg08990 

At4g33330 MTIMTMI 7 

Atlg77130 MIPS 4 

At3gl8660 MANSPAAPAPTT 12 



Atlg56600 
Atlg09350 
At2g47180 
Atlg60450 
Atlg60470 
At4g26250 
At5g23790 
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At5g23790 



LDWRERLAMQSVRSLFSKEILDVIATSTADLGPLSLDSFKKNNLSASWRGTGVDPSFRHS : 

MTDACCLKGNEDKMVPRFGHGTWIGKAFN — : 

NIRLGRVKMGDLSPSWVLENPAYQVSRKTKGSQLVIPRDSFQNDTGMEDNASHSTTNQTD : 

LNLLPRE SPAVLRGGLVGAVYSDKNSRRLDQLSARVLSATD- 

R LTAIEQDE-EGLKEPKRILQ-DRDFNSVVLSNSSD 

R LSAIEHEDGEGLKGPRLILFKDGEFNSSAES : 

SPGFVTVQPASSFESFTRINATKHTQRDVSERVDEVLQKINPVLPKKSDINVGSRDVNAT : 

MANHHRLLRGGGSPAII- : 

MAVAFRGGRGGVGSGQS- : 

MRRRGGDSFRR- : 

MRRWPVDHRR- ! 



-MRSGRRPQGRRIAIRNETETELRSRIGEKSGSEKRNL 37 



TKTHNSRGKIFMIYLILVSLSLLGLILP- 



MKMAPSKSALIRFNLVLLGFSFLLYTAI FFHPSSS 42 

SSPMESRHRLSFSNEKTSRRRFQRIEKG VKFNT 37 

TTGGDSRRRLSASIEAICKRRFRRNSKGGGRSDMVKPFNIINFSTQDKNSSCCCFTKFQI 72 



At3g61130 

At2g46480 
At4g38270 
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ENPATPDVKSNNLNEKRDSISKDSIHQKVETPTKIHRRQLREKRREMRANELVQHNDDTI 180 

DTPEMLHERSLRQEKRLERANELMN — DDSL 58 

ES ENQFPNVDFAS PAKLKRQ I LRQERRGQRT LE L IRQEKETD 158 

DDTHSHTDISIKQVTHDAASDSHINRENMHV 12 5 

KS NDTVQSNEGDQKNFLSEVDKGNNHKPKEE-QAVSQKTTVSSNAE 13 6 

DGGNTYKNREEQVIVSQKMTVSSDEK 114 

SG TDSKKRGLPVSPTVVANPSPANKTKSEASYTG 142 

QQR ITLTAFASTIALFLFTLSFFFASDS 45 

TGLRSFFSYRIFISALFSFLFLATFSVVLNS 4 8 

AGRRKISNVVWWVLS--GIALLLFFLILSKA 40 

RGRRRLSSWIWFLLGSFSVAGLVLFIVQHYH 41 

MQLHISPS-LRHVTVVT GKG 19 



IG SDVGMQLHISPS-MRSITISS SNE 62 

MKFYISATGIKKVTISNPGVGIGKGSG 27 



-LFRITSPSSTLRIDLPSPQVNKNPKWLRLIRNYL 67 



VYFSSGASFVGCSFRDCTPKVVRGVKMQELVEENEINKKDLLTASNQTKLEAPSFMEEIL 102 
LKLVLICIMLGALFTIYRFRYPPLQIPEIPTSFGLTTDPRYVATAEINWNHMSNLVEKHV 97 
VKLLLFILLSATLFTIIYS--PEAYHHSLSHSSSRRQDPRYFSDLDINWDDVTKTLENIE 130 
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LKLENAAIERSKSVDSAVLGKYSIWRRENENDNS-DSNIRLMRDQVIMARVYSGIAK 23 6 

QKLETAAMARSRSVDSAPLGNYTIWKNEYRRGKSFEDMLRLMQDQIIMARVYSGLAK 115 

EQMQEAAIQKSMSFENSVIGKYSIWRRDYESPNA-DAILKLMRDQIIMAKAYANIAK 214 

QLTQQTSEKVDEQPEPNAFG AKKDTGNVLMPDAQVRHLKDQLIRAKVYLSLPS 17 9 

VKISARDIQLNHKTEFRPPSSKSEKNTRVQLERATDERVKEIRDKIIQAKAYLNLAL 193 

GQILPTVNQLANKTDFKPPLSKGEKNTRVQPDRATDVKTKEIRDKIIQAKAYLNFAP 171 

VQRKIVSGDETWRTCEVKYGSYCLWREENKEPMKDAKVKQMKDQLFVARAYYPSIAK 199 

NDSPDLLLPGVEYSNGVGSRRSMLDIKSDPLKPRLIQIRKQADDHRSLALAYASYARKLK 105 
SRHQPHQDHTLPSMGNAYMQRTFLALQSDPLKTRLDLIHKQAIDHLTLVNAYAAYARKLK 108 

GHIEPRPSIPKRRYRNDKFVEG-MNMTEEMLSP--TSVARQVNDQIALAKAFVVIAK 94 

HQQDPSQLLLERDTRTEMVSPPHLNFTEEVTSA--SSFSRQLAEQMTLAKAYVFIAK 96 

LREFIKVKVGSRRFSYQMVFYSLLFFTFLLRFVFVLSTVDTIDGDPSPCSSLACLGKRLK 7 9 

MKIKVAARHISYRTLFHTILILAFLLPFVFILTAVVTLEG-VNKCSSFDCFGRRLG 55 

FIDLMKIKVAARHISYRTLFHTILILAFLLPFVFILTAVVTLEG-VNKCS SIGRRIG 118 

GCAAAAAALAARRFSSRTLLLLLLLLAIVLPFIFVRFAFLVLESASVCDSPLDCMGLRLF 87 



PE--KRIQVGFLNIDEKERESYEARGPL VLKNIHVPLDHIPKNVTWKSLYPEWIN 12 0 



TRGLGKTKIGMVNMEECDLTNWKRYG ETVHIHFERVSKLFKWQDLFPEWID 153 

FGRSEYQGIGLINLNDNEIDRFKEVTKS DCDHVALHLDYAAKNITWESLYPEWID 152 

EGRT IGVLNFDSNEIQRWREVSKSKDNGDEEKVVVLNLDYADKNVTWDALYPEWID 18 6 
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-LKNKNDLLQELQARLKDSQ-- 

-FTNNLALHQEIETQLMK 

-SKNVTNLYVFLMQQCGENK-- 
-AKANAHFVRELRLRIKEVQ-- 



-RVLGEATSDADLPRSAHEKLRAMGQVLA 2 83 

-LAWEEESTDIDQEQRVLDSIRDMGQILA 160 

-RVIGKATSDADLPSSALDQAKAMGHALS 2 61 

-RALADASKDSDLPKTAIEKLKAMEQTLA 22 6 



— PGNNSQIVKELRVRTKELER ATGDTTKDKYLPKSSPNRLKAMEVALY 24 0 

--PGSNSQVVKELRGRLKELER SVGDATKDKDLSKGALRRVKPMENVLY 218 

--MPSQSKLTRDMKQNIQEFER ILSESSQDADLPPQVDKKLQKMEAVIA 24 6 

LENSKLVRIFADLSRNYTDLINKPTYRALYDSDGASIEESVLRQFEKEVKERIKMTRQVI 165 
LDASKQLKLFEDLAINFSDLQSKPGLKSAVSDNGNALEEDSFRQLEKEVKDKVKTARMMI 168 

--ESKNLQFAWDLSAQIRNSQ LLLSSAATRRSPLTVLESESTIRD MAVLLY 143 

--EHNNLHLAWELSSKIRSCQ LLLSKAAMRGQPISFDEAKPIITG LSALIY 145 

PKLLGRRVDSGNVPEAMY QVLEQPLSEQELKGRSDIPQTLQDFMS 124 

PRLLGRIDDSEQRLVRDF YKILNEVSTQEIPDGLKLPESFSQLVS 100 

PRLLGRVDDSE-RLARDF YKILNEVSTQEIPDGLKLPNSFSQLVS 162 

R GGDTSLKIGEELT RALVEETTDHQDVNGRGTKGSLESFDD 128 

MSQHLLLLILLSL 13 

MHSKFILYLSI 11 

MSS LRLRLCL 10 

MASRSLSYTQLLGLLSFI 18 

MHWITRFSAFFSAALAM 17 

MLWITRFAGLFSAAMAV 17 

MRFSGLFSAALVI 13 

MSSRFSLTVVC 11 

MRLRFPMKSAVLAFAIF 17 

MMSGSRLASRLIIIFSI 17 

EEASTCPEIPLPQPEGSDANVDVIVARVPCDGWSANKGLRDVFRLQVNLAAANLAV 17 6 

MEGSEADVDVVWKVPCDGFSEKRGLRDVFRLQVNLAAANLVV 4 3 

EEEETEVPTCPEIPMPDFESLEK-LDLVWKLPCN-YPEEGWRREVLRLQVNLVAANLAA 211 
EVEEFEVPTCPSLPLIQ-IPGKPRIDLVIAKLPCD--KSGKWSRDVARLHLQLAAARVAA 20 9 
EEQETEVPVCPNIPNIK-VPTR-RLDLIWKLPCR--KEGNWSRDVGRLHLQLAAATVAA 242 
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kakmqlydcklvtgklramlqtadeqvrslkkqstflaqlaaktipnpihilsmrltidy 34 3 
raheqlyecklvtnklramlqtvedeleneqtyitfltqlaskalpdaihIltmrlnley 22 0 
lakdelydchelakkfrailqsterkvdglkkkgtfliqlaaktfpkplhIlslqlaady 321 
kgkqiqddcstvvkklramlhsadeqlrvhkkqtmfltqltaktipkglhIlplrlttdy 28 5 
kvsrafhncpaiatklqamtykteeqaraqkkqaaylmqlaarttpkglhilsmrlttey 30 0 
kasrvfnncpaiatklramnynteeqvqaqknqaaylmqlaarttpkglh|lsmrltsey 27 8 
kaksfpvdcnnvdkklrqildltedeasfhmkqsvflyqlavqtmpkslh|lsmrltveh 30 6 
aeakesfdnqlkiqklkdtifavneqltnakkqgafssliaaksipkglhIlamrlmeer 225 
veskesydtqlkiqklkdtifavqeqltkakkngavaslisaksvpkslh|lamrlvger 22 8 
qaqqlhydsatmimrlkasiqaleeqmssvseksskygqiaaeevpksly|lgvrlttew 203 
kaqdahydiattmmtmkshiqaleeranaatvqttifgqlvaealpkslhcltikltsdw 205 
evkrsksdarefaqklkemvtlmeqrtrtakiqeylyrhvasssipkqlh|lalklaneh 184 
dmknnhydaktfalvframvekferdlreskfaelmnkhfaassipkgihIlslrltdey 160 
dmknnhydaktfalvlrammekferdmreskfaelmnkhfaassipkgihIlslrltdey 222 
lvkemtlkrrdirafasvtkkmmerkvqsakhhelvywhlashgipkslhIlslrlteey 188 

lllhkpisatt iiqkfkeapqfynsadIpliddsesd 50 

lavftvsfag gerfkeapkffnspeIltiended- 45 

llllpitiscvtv tltdlp afreapafrngrecskttwi psd 52 

lllvttttmavrvgvilhkpsaptlp vfreapafrngdqcg 5 9 

illspslqsfspaaairsshpyadefkpqqns--dyssfrespmfrnaeqcrssgedsgv 75 

ivlspslqsfppaaairssp--spifr kapavfnngdeglssg gv 60 

ivlspslqsfppaeairsshldaylrfpssdppphrfsfrkapvfrnaad|aaadidsgv 73 
liallpfvvgirliparitsvgdggggggnngfsklgpfmeapeyrngke|vsssvnren 71 

lvfiplfsvgirmipgrltavsatvgng fdlgsfveapeyrngke|vsqslnren 72 

istsfftvesirlfpdsfddass dlmeapayqngldIsvlaknrll 63 

qsglrtvnqavyvvfigscgpmheifpcdervmrvedywvykpylprlkqkllmpvgscq 23 6 
esgrrnvdrtvyvvfigscgpmheifrcdervkrvgdywvyrpdltrlkqkllmppgscq 103 
kkgktdwrwkskvlfwskcqpmieifrcddlekreadwwlyrpevvrlqqrlslpvgscn 271 
ss — kglhnvhvilvsdcfpipnlftgqelvarqgniwlykpnlhqlrqklqlpvgsce 2 66 
SA — kgffrghvffvsrcfpipnlfrckdlvsrrgdvwlykpnldtlrdklqlpvgsce 2 99 



MAPEMNN-- 7 

MAPGLTQTA 9 

MTPETHVD- 8 

MAPEISVN- 8 

-maqmsmtveks- 11 
mtmtvekr- 8 



At3g61130 YLLSP EKRKFPRSENLENPNLYHYALFSDNVLAASVVVNSTIMNAKDPSKHVF|lV 399 

At2g4 64 8 0 HLLPL PMRNFPRRENLENPKLYHYALFSDNVLAASVVVNSTVMNAQDPSRHVf|lV 27 6 

At4g3827 0 FILGFNEEDAVKEDVSQKKLEDPSLYHYAIFSDNVLATSVVVNSTVLNAKEPQRHVf|iV 381 

At5g477 8 0 YALNS SEQQFPNQEKLEDTQLYHYALFSDNVLATSVVVNSTITNAKHPLKHVf|iV 34 2 

At2g3057 5 FTLDH EKRQLL-QQSYNDPDLYHYVVFSDNVLASSVVVNSTISSSKEPDKIVF|vV 355 

Atlg0 67 8 0 FSLDP EKRQMPNQQNYFDANFNHYVVFSDNVLASSVVVNSTISSSKEPERIVf|vV 334 

At2g38 650 FKSDS LEDPISEKFSDPSLLHFVIISDNILASSVVINSTVVHARDSKNFVf|vL 3 60 

At3g25140 I AIIP- -EKYTDEGKDRPRELEDPNLYHYAIFSDNVIAASVVVNSAVKNAKEPWKHVfIvV 2 83 

At3g02350 ISNP--EKYKDAPPDP--AAEDPTLYHYAIFSDNVIAVSVVVRSVVMNAEEPWKHVFivV 2 84 

At2g20810 FQNLDLQRTLKERSRVDSKLTDNSLYHFCVFSDNI IATSVVVNSTALNSKAPEKVVF|lV 2 63 

AtlgiBSBO vtep-srheladenrnsprlvdnnlyhfcifsdnviatsvvvnstvsnadhpkqlvf|iv 2 64 

At5g54 690 SINAAAR-LQLPEAELVPMLVDNNYFHFVLASDNILAASVVAKSLVQNALRPHKIVl|iI 243 

At3gOi040 ssnahar-rqlpspellpvlsdnayhhfvlatdnilaasvvvssavqssskpekivfIvi 219 

At5gl5470 SSNAHAR-RQLPSPEFLPVLSDNAYHHFILSTDNILAASVVVSSAVQSSSKPEKIVf|iI 281 

At3g587 90 SVNAMAR-MRLPPPESVSRLTDPSFHHIVLLTDNVLAASVVISSTVQNAVNPEKFVF|IV 247 

Atlgl9300 DDVVAKP IFCSRRAVHVAMTLDAAYIRGSVAAVLSVLQHSSCPENIVf|fV 101 

At3g507 60 FVCSDKAIHVAMTLDTAYLRGSMAVILSVLQHSSCPQNIVf|fV 8 9 

Atlgl3250 HEHNPSIIHIAMTLDAIYLRGSVAGVFSVLQIIASCPENIVFHFI 96 

At3g06260 TREADQIHIAMTLDTNYLRGTMAAVLSLLQIISTCPENLSFHFL 102 

Atlg02720 CN PNLVHVAITLDIDYLRGSIAAVNSILQHSMCPQSVFFIfL 117 

At4g02130 CN PSLVHVAITLDVEYLRGSIAAVNSILQHSVCPESVFf|fI 102 

At3g62660 CN PSLVHVAITLDFEYLRGSIAAVHSILKHSSCPESVFf|fL 115 

Atlg2417 0 -FVSSSS SSNDPSLVHIAMTLDSEYLRGSIAAVHSVLRHASCPENVFf|fI 121 

Atig70090 -Fvss cdaslvhvamtldseylrgsiaavhsmlrhascpenvffIli 118 

At3g28340 -LACD PSAVHIAMTLDPAYLRGTVSAVHSILKHTSCPENIFfIfI 107 

Atlg54940 IAPSFAQ-FGQEAWRPKHEDNLASKAVTALPRRLRVAYVTVLHSSEAYVCGAIALAQSIR 2 95 

Atlg08990 IAP--LG-QGEAWIQDKNRNLTSEKTTLSSFTAQRVAYVTLLHSSEVYVCGAIALAQSIR 160 

At4g33330 LALP LWAPQGVDKVYDLTKIEAETKRPKREAYVTVLHSSESYVCGAITLAQSLL 325 

Atlg77130 LSVP LQAKDNFYSAGAKKEAYATILHSAQFYVCGAIAAAQSIR 309 

At3gl8660 LSLP LGIQDRPSLGNPKREAYATILHSAHVYVCGAIAAAQSIR 342 

Atlg56600 KLTVPVHSATGGEKRAYVTFLAGTGDYVKGVVGLAKGLR 4 6 

Atlg09350 KLSY GEKKRAYVTFLAGTGDYVKGVVGLAKGLR 40 

At2g47180 DAMS TVTITKPSLPSVQDSDRAYVTFLAGNGDYVKGVVGLAKGLR 54 

Atlg60450 MINASEKAPKE RAYVTFLAGNGDYVKGVVGLAKGLR 44 

Atlg60470 PMYLSEKAHQAPPRRAYVTFLAGNGDYVKGVVGLAKGLR 47 

At4g26250 IKADVTVSHDRVKRAYVTFLAGNKDYWMGVVGLAKGLR 4 9 

At5g23790 IEADVTVSHEGVERAYVTFLAGNKDYWMLVVGLAKGLR 4 6 
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TDKLNFGAMNMWFLLN- 
TDKLNFGAMSMWFLLN- 
TDKLNFGAMKMWFRIN- 
TDRLNYAAMRMWFLDN- 
TDSLNYPAISMWFLLN- 



-PPGKATIHVENVDEFKWLNSSYCPVLRQLESAAMREYYFKA- 
-PPGEATIHVQRFEDFTWLNSSYSPVLSQLESAAMKKFYFKT- 
-APADATIQVENINDFKWLNSSYCSVLRQLESARLKEYYFKA- 
-PPGKATIQVQNVEEFTWLNSSYSPVLKQLSSRSMIDYYFRA- 
-PSGRASIQILNIDEMNVLPLYHAELLMKQNSS 



456 
333 
438 
399 
403 
382 



TDSLNYPAISMWFLLN--IQSKATIQILNIDDMDVLPRDYDQLLMKQNSN 

TDEQNYFAMKQWFIRN--PCKQSTVQVLNIEKLELDDSDMKLSLSAEFRVSFPSGDLLAS 418 

TDKMNLGAMQVMFKLK-EYK-GAHVEVKAVEDYTFLNSSYVPVLKQLESANLQKFYFE-- 33 9 

TDRMNLAAMKVWFKMR-PLDRGAHVEIKSVEDFKFLNSSYAPVLRQLESAKLQKFYFE-- 341 

TNEINYAAMKAWFAINMDNLRGVTVEVQKFEDFSWLNASYVPVLKQLQDSDTQSYYFS-- 321 

TNRVSYKAMQAWFLS--NDFKGSAIEIRSVEEFSWLNASYSPVVKQLLDTDARAYYFG-- 320 

TDRKTYFPMQAWFSLH--PLSPAIIEVKALHHFDWLSKGKVPVLEAMEKDQRVRSQFRGG 301 

TDKKTYAGMHSWFALN--SVAPAIVEVKSVHQFDWLTRENVPVLEAVESHNSIRNYYHGN 277 

TDKKTYAGMHSWFALN--SVAPAIVEVKGVHQFDWLTRENVPVLEAVESHNGVRDYYHGN 339 

TDKKTYTPMHAWFAIN--SASSPVVEVKGLHQYDWPQEVNFKVREMLDIHRLIWRRHYQN 305 



ASASADASS--LRATIS-- 
TSK--QSHR--LQNYVV-- 

ATHRRSAD LRRIIS-- 

SLPHFEND LFTSIK-- 

VSSESQN LESLIR-- 

AVSEETNL LESLVR-- 

VS ETD LESLIR-- 

AAEFDSASPRVLSQLVR-- 
AAEFDPASPRVLSQLVR-- 

ASGTSQGS LAKTLS-- 

QSGSHKDMILLHDHTIT-- 

QSGSTKDMILLHDDSIT NISLIGLSLAGWKLRRVERIRSP- 

QTNTKRDLILLHDDSIS ITKLRALAAAGWKLRRIIRIRNP- 



SSFPYLD-FTVYVFNVSSVSRLISS 140 

ASFPYLK-FRIYPYDVAAISGLIST 126 

STFPYLT-YHIYHFDPNLVRSKISS 134 

STFPYLN-FKIYQFDPNLVRSKISK 140 

STFPKLTNLKIYYFAPETVQSLISS 155 

SVFPRLK-FNIYDFAPETVRGLISS 140 

STFPELK-LKVYYFDPEIVRTLIST 150 

STFPSLN-FKVYIFREDTVINLISS 162 

STFPSLN-FKVYIFREDTVINLISS 159 

SVFPSLS-FKVYTFEETTVKNLISS 145 

■ — NKSLIGLSAAGWNLRLIDRIRSP 335 

200 

365 



MSGSTRDLVILVDETIS-' 
QSGSTRDLVILVDDNIS-' 
KAKSKYPLVVAVLPDVP- 
KTKSKYPLVVAVLPDVP- 



■-EYHKSGLVAAGWKIQMFQRIRNP-- 

■ - GYHRS GLE AAGWQ IRTIQRIRNP-- 

■ - E DHRKQLVDQGC VVKE I E PVY P P - ■ 
■-ADHRRQLLDQGCVIKEIQPVYPP-- 



349 
382 



KVKSAYPLVVAMLPDVP EEHRRILVDQGCIVREIEPVYPP-- 

KVKSAYPLVVAMLPDVP EEHREILRSQGCIVREIEPVHPP-- 

KVKSAYPLVVAMLPDVP EEHREILRSQGCVVREIEPVYPP-- 

KVKSAYPLVVAILPDVP EEHRQILLAQGCIIREIEPVYPP-- 

KVKSAYPLVVATLPDVP EEHRQILVDQGCIIRDIEPVYPP-- 



At3g61130 DHP TSGSSNLKYRNPKYLSMLNHLRFYLPEVYP-KLNKILFLiD|lIVQKi 50 6 

At2g4 64 8 0 ARSESVESGSENLKYRYPKYMSMLNHLRFYIPRIFP-KLEKILFviD|vVVQKi 38 6 

At4g38270 NHPS S I SAGADNLKYRNPKYLSMLNHLRFYLPEVYP-KLEKI LFlIdIi VVQkI 4 91 

At5g477 8 0 HHTN SDTNLKFRNPKYLSILNHLRFYLPEIFP-KLSKVLFlIdIiVVQkI 44 8 

At2g30575 DPRIISALNHARFYLPDIFP-GLNKIVLFIhIvVVQRI 440 

Atlg0 67 8 0 DPRFISTLNHARFYLPDIFP-GLNKMVLL|H|vVVQRi 419 

At2g38 650 QQNRTHYLSLFSQSHYLLPKLFD-KLEKVVIL|D|vVVQRi 458 

At3g2514 0 NKLENATKDTTNMKFRNPKYLSILNHLRFYLPEMYP-KLHRILFLiDlvVVQKi 3 92 

At3g02350 NQAENATKDSHNLKFKNPKYLSMLNHLRFYLPEMYP-KLNKILFlIdIvVVQkI 3 94 

At2g20B10 GHN-DDGRTPI--KFRNPKYLSMLNHLRFYIPEVFP-ALKKVVFLiD|vVVQKi 371 

Atigi8580 eqtsqdtisep--kvrnpkylsllnhlrfyipeiyp-qlekivfl|d|vvvqr| 371 

At5g54 690 SSVIVANNKENPVVVAAKLQALSPKYNSLMNHIRIHLPELFP-SLNKVVFL|d|iVIQt| 360 

At3gOi040 hiaganlsettprtfasklqsrspkyisllnhlriylpelfp-nldkvvflIdIiviqkI 33 6 

At5gl5470 HVAGANLTETTPRTFASKLQSRSPKYISLLNHLRIYIPELFP-NLDKVVFLiDllVVQGl 3 98 

At3g587 90 LKDSDFSFVEG--THEQSLQALNPSCLALLNHLRIYIPKLFP-DLNKIVLLDDDVVVQS:D; 3 62 

Atlgl9300 SIRSALDCPLNYARSYLADLLPPCVRRVVYLDSDLILVD;! 180 

At3g50760 SIRSALDSPLNYARNYLADILPTCLSRVVYLDSDLILVD| 166 

Atlgl3250 SIRRALDQPLNYARIYLADLLPIAVRRVIYFDSDLVVVdI 174 

At3g06260 SIRQALDQPLNYARIYLADIIPSSVDRIIYLDSDLVVVd| 180 

Atlg02720 SVRQALEQPLNYARNYLADLLEPCVKRVIYLDSDLVVVDi 195 

At4g02130 SVRQALEQPLNYARSYLADLLEPCVNRVIYl|s|lVVVD| 180 

At3g62660 SVRQALEQPLNYARNYLADLLEPCVRRVIYLis|LIVVD| 190 

Atlg24170 SIRLALENPLNYARNYLGDILDRSVERVIYLisiviTVDi 202 

Atlg70090 SIRQALENPLNYARNYLGDILDPCVDRVIYLDSBlIVVDi 199 

At3g28340 SIRQALDSPLNYARSYLSEILSSCVSRVIYLDSgVIVVDi 185 

Atlg54940 FSQ KDSYNEWNYSKLRVWQVTDYDKLVFIDaIfIILKK 373 

Atlg08990 FSK KRSYNEWNYSKLRVWQVTDYDKLVFIDAbFI IVKN 238 

At4g33330 LAE KDSYNEYNYSKrRLWQLTDYDKVIFIpADIIVLRN 403 

Atlg77130 NAV PNAYNEWNYSKFRLWQLTEYSKI IFiIaImLILRN 387 

At3gl8660 KAE KDAYNEWNYSKFRLWQLTDYDKIIFIi|AiLLILRN 420 

Atlg56600 ENQTEFAMAYYVINYSKLRIWEFVEYNKMIYliiG|lQVFDN 127 

Atig09350 dnqtqfamayyvlnysklriwkfveyskliylIgIiqvfen 121 

At2g47180 ENQTQFAMAYYVINYSKLRIWKFVEYSKMIYlIgIiQVYEN 135 

Atlg60450 DSQDAYARAYYIINYSKLRIWNFEEYNKMIYl|a|iQVFGN 125 

Atlg60470 DNQVEFAMAYYVLNYSKLRIWNFEEYSKMIYlIaIiQVFDN 128 

At4g26250 ENKTGYSMAYYVINYSKLRIWEFVEYEKMIYL|g|iQVFSN 130 

At5g23790 ENTTGYSMAYYVINYSKLRIWEFVEYEKMIYLIgIiQVFKN 127 



At3g61130 LTPiiWEVNLNGKVNGAVETCGE SFHRFDKYLNFSNPHIARNFNPN ACGWAY 

At2g4 64 8 0 LTpIwSIDLKGKVN ENFDPK FCGWAY 

At4g3827 0 LAP|WEIDMQGKVNGAVETCKE SFHRFDKYLNFSNPKI SENFDAG ACGWAF 

At5g477 8 0 LSG|wSVDLKGNVNGAVETCGE SFHRFDRYLNFSNPLISKNFDPR ACGWAY 

At2g3057 5 LTR|wSLDMTGKVVGAVETCLEGD — PSYRSMDSFINFSDAWVSQKFDPK ACTWAF 

Atlg0 67 8 0 LSr|wSIDMKGKVVGAVETCLEGE--SSFRSMSTFINFSDTWVAGKFSPR ACTWAF 

At2g38 650 LSpIwDLDMEGKVNGAVKSCTVR LGQLRSLKRGNFDTN ACLWMS 

At3g2514 0 LTG|wEIDMDGKVNGAVETCFG SFHRYAQYMNFSHPLIKEKFNPK ACAWAY 

At3g02350 VTG|wKINLDGKVNGAVETCFG SFHRYGQYLNFSHPLIKENFNPS ACAWAF 

At2g20810 LSS|fSIDLNKNVNGAVETCME TFHRYHKYLNYSHPLIRSHFDPD ACGWAF 

Atlgl8580 LTp|fSLDLHGNVNGAVETCLE AFHRYYKYLNFSNPLISSKFDPQ ACGWAF 

At5g54 690 LSp|wDIDMNGKVNGAVETCRGEDKFVMSKKFKSYLNFSNPTIAKNFNPE ECAWAY 

At3g01040 LSp|wDIDLNGKVNGAVETCRGEDVWVMSKRLRNYFNFSHPLIAKHLDPE ECAWAY 

At5gl5470 LTP|wDVDLGGKVNGAVETCRGEDEWVMSKRLRNYFNFSHPLIAKHLDPE ECAWAY 

At3g587 90 LSSlwETDLNGKVVGAVVDSWCGDNCCPGRKYKDYFNFSHPLISSNLVQE DCAWLS 

Atlgl9300 IAK|aATDLGRDSVLAAPEYCNAN FTSYFTSTFWSNPTLSLTFADR KACYFNT 

At3g507 60 ISKIfSTHIPTDVVLAAPEYCNAN FTTYFTPTFWSNPSLSITLSLNRRATPCYFNT 

Atlgl3250 VAKLWRI DLR-RHVVGAPEYCHAN FTNYFT S RFWS SQGYKS ALKDR KPCYFNT 

At3g0 62 60 lEKLWHVEME-GKVVAAPEYCHAN FTIIYFTRTFWSDPVLVKVLEGK RPCYFNT 

Atlg0272 0 IVKLWKTGLG-QRTIGAPEYCIIAN FTKYFTGGFWSDKRFNGTFKGR NPCYFNT 

At4g02130 lAKLWKTSLG-SRI IGAPEYCIIAN FTKYFTGGFWSEERFSGTFRGR KPCYFNT 

At3g62 6 60 lAKLWMTKLG-SKTIGAPEYCIIAN FTKYFTPAFWSDERFSGAFSGR KPCYFNT 

Atlg2417 0 ITKLWNTVLTGSRVIGAPEYCHAN FTQYFTSGFWSDPALPGLISGQ KPCYFNT 

Atlg7 0 0 90 ITKLWNTSLTGSRIIGAPEYCHAN FTKYFTSGFWSDPALPGFFSGR KPCYFNT 

At3g2 834 0 IQKLWKISLSGSRTIGAPEYCHAN FTKYFTDSFWSDQKLSSVFDSK TPCYFNT 

Atlg54940 LDH|FYYPQLSASGNDKVLFNSGIMVLEPSACMFKDLMEKSFKIESYNGG DQG--FL 

Atlg08990 IDY|fSYPQLSAAGNNKVLFNSGVMVLEPSACLFEDLMLKSFKIGSYNGG DQG--FL 

At4g33330 LDIJ|fHFPQMSATGNDVWIYNSGIMVIEPSNCTFTTIMSQRSEIVSYNGG DQG--YL 

Atlg77130 IDF|fEFPEISATGNNATLFNSGLMVVEPSNSTFQLLMDNINEVVSYNGG DQG--YL 

At3gl8660 IDF|fSMPEISATGNNGTLFNSGVMVIEPCNCTFQLLMEHINEIESYNGG DQG--YL 

Atlg56600 I DI||fDLPNGQFYAVMDCFCEKTWSHS PQYKI GYCQQCPDKVTWPEAKLG PKPPLYF 

Atlg09350 IDh|fDLPDGNFYAVKDCFCEKTWSHTPQYKIGYCQQCPDKVTWPESELG PKPPLYF 

At2g47180 IDhIfDLPDGYLYAVMDCFCEKTWSHTPQYKIRYCQQCPDKVQWPKAELG EPPALYF 

Atlg60450 IDDIjFDMQDGYLHGVLSCFCEKIWSYTPLYSIGYCQYCPEKVVWPAEMES APPSPYF 

Atlg60470 IDHiFDLSDAYFYAVMDCFCEKTWSHSLQYSIGYCQQCPEKVTWPEDMES PPPPLYF 

At4g26250 IDH|fDTPRGYLYAVKDCFCEISWSKTPQFKIGYCQQCPEKVTWPVESLG SPPPVYF 

At5g23790 IDHi|FDTPRGYLYAVKDCFCEVSWSKTPQYKIGYCQQSPEKVTWPVESLG APPPVYF 



557 

412 



473 
502 
443 
445 
422 
422 
416 
392 
454 
418 
233 
222 
226 
232 
247 
232 
242 
255 
252 
238 
428 
293 
458 
442 
475 
184 
178 
192 
182 
185 
187 
184 



At3g61130 iMNMFDLKEWKKRDITGIYHKWQNMNENR — TLWKLGTLPPGLI TFYGLTHPLNKAIKVL 615 

At2g4 64 8 0 |mNIFDLKEWKKNNITETYHFWQNLNENR — TLWKLGTLPPGLITFYNLTQPLQRkIhLL 47 0 

At4g3827 0 |mNMFDLKEWRKRNITGIYHYWQDLNEDR — TLWKLGSLPPGLITFYNLTYAMDRsIhVL 60 0 

At5g477 8 0 |mNVFDLDEWKRQNITEVYHRWQDLNQDR--ELWKLGTLPPGLITFWRRTYPLDRk|hIL 557 

At2g3057 5 ImNLFDLEEIrRQELTSVYLKYFDLGVKG — HLWKAGGLPVGWLTFFGQTFPLEKrInVG 552 

Atlg0 67 8 0 |MNLIDLEEiRIRKLTSTYIKYFNLGTKR--PLWKAGSLPIGWLTFYRQTLALDKRiHVM 531 

At2g38 650 IlNVVDLArIrALGVSETYQKYYKEMSSGD-ESSEAIALQASLLTFQDQVYALDDkIaLS 5 61 

At3g2514 0 |MNFFDLDA|RREKCTEEYHYWQNLNENR--ALWKLGTLPPGLITFYSTTKPLDKSiHVL 501 

At3g02350 |MNIFDLNA|RREKCTDQYHYWQNLNEDR--TLWKLGTLPPGLITFYSKTKSLDKSiHVL 503 

At2g20810 |mNVFDLVE|rKRNVTGI YHYWQEKNVDR--TLWKLGTLPPGLLTFYGLTEALEASiHIL 480 

AtlgiBSBO |mnvfdlia|rnanvtaryhywqdqnrer--tlwklgtlppgllsfygltepldrr|hvl 480 

At5g54 690 |mNVFDLAa|rRTNISSTYYHWLDENLKSDLSLWQLGTLPPGLIAFHGHVQTIDPf|hML 47 6 

At3gOi040 ImnifdlrtIrktniretyhswlkenlksnltmwklgtlppaliafkghvqpidssIhml 452 

At5gi5470 ImnifdlqaIrktniretyhswlrenlksnltmwklgtlppaliafkghvhiidssIhml 514 

At3g587 90 GMNVFDLKaIrQTNITEAYSTWLRLSVRSGLQLWQPGALPPTLLAFKGLTQSLEPsIhVA 478 

Atlgl9300 GVMVIDLSR|REGAYTSRIEEWMAMQKR--MRIYELGSLPPFLLVFAGLIKPVNHRiNQH 2 91 

At3g507 60 GVMVIELKKiREGDYTRKIIEWMELQKR--IRIYELGSLPPFLLVFAGNIAPVDHRiNQH 280 

Atlgl3250 GVMVIDLGKWRERRVTVKLETWMRIQKR--HRIYELGSLPPFLLVFAGDVEPVEHRK[nQH 2 84 

At3g0 62 60 GVMVVDVNKWRKGMYTQKVEEWMTIQKQ--KRIYHLGSLPPFLLIFAGDIKAVNHRWNQH 2 90 

Atlg0272 0 GVMVIDLKKWRQFRFTKRTEKWMEIQKT--ERIYELGSLPPFLLVFAGHVAPISHRWNQH 305 

At4g02i30 gvmvidlkkwrrggytkr:ekwme:qrr--eriyelgslppfllvfsghvapishrwnqh 2 90 

At3g62 6 60 gvmvmdlerwrrvgytev:ekwme:qks--driyelgslppfllvfagevapiehrwnqh 30 0 

Atlg24170 GVMVMDLVRWREGNYREKLEQWMQLQKK- -MRI ydlgslppfllvfagnveai DHRIa/NQH 313 

Atig7 0 0 90 gvmvmdlvrwregnyrekletwmq:qkk--kr:ydlgslppfllvfagnveaidhrwnqh 310 

At3g2 834 0 GVMVIDLERWREGDYTRKIENWMKIQKED-KRIYELGSLPPFLLVFGGDIEAIDHQiNQH 2 97 

Atlg54940 NEIFVWWHRLSKRVNTMKYFDE--KNHRRHD LPENVEGLHYLG-LKPWVCYRD 47 8 

Atlg08990 NEYFVWWHRLSKRLNTMKYFGDESRHDKARN LPENLEGIHYLG-LKPWRCYRD 345 

At4g33330 NEIFVWWHRLPRRVNFLKNFWSNTTKERNIKNNLFA-AEPPQVYAVHYLG-WKPWLCYRD 516 

Atlg77130 NEIFTWWHRIPKHMNFLKHFWEGDEPEIKKMKTSLFGADPPILYVLHYLGYNKPWLCFRD 502 

At3gl8660 NEVFTWWHRIPKHMNFLKHFWIGDEDDAKRKKTELFGAEPPVLYVLHYLG-MKPWLCYRD 534 

Atlg56600 NAGMFVYEPNLSTYHNLLETVKIVPPTLFAEQDFLNMYFKDIYKPIPPVYNLVLAMLWRH 244 

Atlg09350 NAGMFVYEPSLPTYYNLLETLKVVPPTPFAEQDFLNMYFKDIYKPIPPVYNLVLAMLWRH 238 

At2g47180 NAGMFLYEPNLETYEDLLRTLKITPPTPFAEQDFLNMYFKKIYKPIPLVYNLVLAMLWRH 252 

Atlg60450 NAGMFVFEPNPLTYESLLQTLQVTPPTPFAEQDFLNMFFGKVFKPVSPVYNLILSVLWRH 242 

Atlg60470 NAGMFVFEPSPLTYESLLQTLEITPPSPFAEQDFLNMFFEKVYKPIPLVYNLVLAMLWRH 245 

At4g26250 NAGMLVFEPNLLTYEDLLRVVQITTPTYFAEQDFLNEYFTDIYKPIPSTYNLVMAMLWRH 247 

At5g23790 NAGMLVFGPNLVTYEDLLRVVQITTPTYFAEQDFLNIYFRDIYKPIPSTYNLVMAMLWRH 244 



At3g61130 

At2g46480 
At4g38270 
At5g47780 
At2g30575 
Atlg06780 
At2g38650 
At3g25140 
At3g02350 
At2g20810 
Atlgl8580 
At5g54690 
At3g01040 
At5gl5470 
At3g58790 
Atlgl9300 
At3g50760 
Atlgl3250 
At3g06260 
Atlg02720 
At4g02130 
At3g62660 
Atlg24170 
Atlg70090 
At3g28340 
Atlg54940 
Atlg08990 
At4g33330 
Atlg77130 
At3gl8660 
Atlg56600 
Atlg09350 
At2g47180 
Atlg60450 
Atlg60470 
At4g26250 
At5g23790 



Ili-YNPSIDKKDIEN-- 
||i-YDKGIDVKKIER-- 
||i-YDPALNQTAIEN- 
Ili-YNPSVNQRDIER-- 
|i|-HESGLRASDIEQ-- 
III- RE S GVKAVD I EQ- ■ 
III- Y D Y Y I NAQA I KN - ■ 
lii-YNPSISMDEIRN-- 
Ili-YNPGVSMDEIRN-- 
Ili-YTN-VDARVIEK-- 
III- YDVN I DNRL I E T - ■ 
IIIyQE T T S - YADAE S - ■ 
lilYQ S KTN - LENAKK- ■ 
IIIyQ S KTN - 1 ENVKK- ■ 
IliSRS VKS PQE I LKS - ■ 



■-aav\^yninmi:i'wlelamskyrpywtkyikfdhpylrrcnlh 67 2 

-saviIynIhmIIwtemgiskyqpywtkytnfdhpyiftcrlf 52 7 

-aavviyn|nyi|wlglafakykpywskyveydnpylrrcdin 657 

■-aaviIynInlIIwleigipryrgfwskhvdyehvylrecnin 614 

■-AAVliYD|lMiiWLDIGIDKYKRYWNIHVPYHHPHLQRCNIH 60 9 
■-AAVl|YD|vMi|iLDIGKENYKRYWNIHVPYHHTYLQQCNLQ 58 8 

■ - AAI lIynInmIIIlELG I PNYKN YWRRHL S RE DRFL S DCNVN 618 
■-AAVviFN|NMi|iLDIAMNQFRPLWTKHVDYDLEFVQACNFG 558 

■ - AGV I HynInmIIIlD I AMNQ YKS LWTKYVDNEME FVQMCN F G 5 60 
■-GAVLiFNlNLiliLKIGIEKYKPLWERYVDYTSPFMQQCNFH 536 
■-AAVl|iYN|NMi|iLKLAIGRYKPFWLKFLNSSHPYLQDCVTA 537 
■-AAVV|iFN|RA|||LDIAFPHLRPLWAKYLDSSDRFIKSCHIR 533 
■-AAVl|YN|Qsi|iLEIGFEHLRPFWTKYVNYSNDFIKNCHIL 50 9 
-AAVliYNlQSiliLEIGFEHLRPFWTKYVNYSNDFIKNCHIL 571 
-ASVLiFS|PAi|iLEISNPEVRSLWYRYVNSSDIFVRKCKIM 536 

IliGDNFRGLCRDLHPGPVSLLiwsiKGiiikRLDAGRPCPLDALWAPYDLLQTPFALDS 351 
liiGDNFRGLCRDLHPGPVSLLiwsiKGiiiVRLDDGRPCPLDALWVPYDLLESRFDLIE 34 0 



GLGGDNLEGLCRNLHPGPVSLLHl 
GLGGDNFEGRCRTLHPGPISLLII! 
GLGGDNVRCSCRDLIISGPYSLLIIi 
GLGGDNVRGSCRDLIIP :,r^ LLII 
GLGGDNVRG.SCRDLIIPGPVSLLHi 



IKPWLRLDSRRPCPLDSLWAPYDLFRYSPLISD 344 
:KPWLRLDSRKPCIVDHLWAPYDLYRSSRHSLE 350 

■KPWLRLDSKLPCPLDTLWAPYDLYKHSH 3 61 

■KPWIRLDSKRPCPLDALWTPYDLYRHSH 34 6 

■KPW?RLDSRRPCPLDTLWAPYDLYGHYSR 357 

GLGGDNIRGSCRSLHPGPVSLLHWSGKGKPWVRLDEKRPCPLDHLWEPYDLYKHKIERAK 373 
IliGDNVRGSCRSLHKGPVSLLHWSGKGKPWVRLDEKRPCPLDHLWEPYDLYEHKIERAK 37 0 
iilGDNIVSSCRSLHPGPVSLIHWSGKGKEWVRLDDGKPCPIDYLWAPYDLHKSQRQYLQ 357 
YDCNWDISERRVFASDSVHEKWWKVYDKMSEQLKGYCGLNKNMEKRIEKWRRIAKNNSLP 538 
YDCNWDLKTRRVYASESVHARWWKVYDKMPKKLKGYCGLNLKMEKNVEKWRKMAKLNGFP 405 
YDCNYDVDEQLVYASDAAHVRWWKVHDSMDDALQKFCRLTKKRRTEINWERRKARLRGST 57 6 
YDCNWNVDIFQEFASDEAHKTWWRVHDAMPENLHKFCLLRSKQKAQLEWDRRQAEKGNYK 562 
YDCNFNSDIFVEFATDIAHRKWWMVHDAMPQELHQFCYLRSKQKAQLEYDRRQAEAANYA 594 
PENIELDQVKVVHYCAAGAKPWRFTGEEENMDREDIKMLVKKWWDIYNDESLDYKNVVIG 304 
PENIELNEAKVVHYCAAGAKPWRFTGQEGNMEREDIKMLVEKWWDIYNDESLDYKNFNVH 2 98 
PENVELGKVKVVHYCAAGSKPWRYTGKEANMEREDIKMLVKKWWDIYDDESLDYKKPVTV 312 
PGKVDLESVKVVHYCPPGSKPWRYTGEEPNMDREDVKMLIKKWWDIYNDESLDFKPKSPA 302 
PENVELEKVKVVHYCAAGSKPWRYTGEEANMDREDIKMLVDKWWDVYNDESLDFKSKIPA 305 
PEHIDLDQISVIHYCANGSKPWRFDETEEHMDREDIKMLVKKWWDIYEDSSLDYKNFVET 307 
PEHIDLDQISVVHYCANGSKPWKFDEAEEHMDREDIKMLVKKWWEIYEDSSLDYKNFVET 304 
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At5g47780 
At2g30575 
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At3g25140 
At3g02350 
At2g20810 
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At3g01040 
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At3g50760 
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Atlg77130 
At3gl8660 
Atlg56600 
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At5g23790 



DQSLLGFASLSELTDDSSFL 

DQSLFGFSSLSELTEDSSFF 

YNQELEIL 

DRHWEIEVRDPRKTNLLVQ 

ENHWK I RI KD PRKKNRL SQ 

DYHWKINVTDPRRRRSYLIG 

DGHWKIKIKDKRLKTCFEDFCFWESMLWHWGETNSTNNSSTTTTSSPPHKTALPSL 

DGHWKIRVKDPRFKICIDKLCNWKSMLRHWGESNWTDYESFVPTPPAITVDRRSSLPGHN 

DSHKKQQTLQQFIEALSEAGALQYVKAPSAA 

CGQKEDVHRKPKTLPQFFTDLSEADVLQCAKAPSAA 

V DTEVDLVNLKPFITALTEAGRLNYVTAPSAA 

D LEATVLESTIIASVTEAPLSYSPAAPSAA 

D AEETVTKSSILASVLEPEMTYFPAPSAA 

ES KLSPINATLASKESVGDVLISLAPSAA 

ES KLNPVTATLASKKLVGDVLTSLAPSAA 



673 
528 
658 
615 
610 
589 
619 
559 
561 
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510 
572 
537 

341 
345 
351 



393 
390 
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557 
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